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Description 

FIELD OF THE INVENTION 

5 [0001] The present Invention relates to polynucleotides encoding novel polypeptides, polypeptides encoded by the 
polynucleotides, and new uses of these. 

BACKGROUND OF THE INVENTION 

10 [0002] Currently, the sequencing projects, the determination and analysis of the genomic DNA of various living or- 
ganisms have been in progress all over the world. The whole genomic sequences of more than 40 species of prokary- 
otes, a lower eukaryote, yeast, a multicellular eukaryote, C. elegans, and a higher plants, arabidopsis, etc. are already 
determined. For human genome, presumably having 3 billion base pairs, the analysis was advanced under global 
cooperative organization, and a draft sequence was disclosed in 2001 . Moreover, all the structures are to be clear and 

15 to be disclosed in 2002 - 2003. The aim of the determination of genomic sequence is to reveal the functions of all genes 
and their regulation and to understand living organisms as a network of interactions between genes, proteins, cells or 
individuals through deducing the information in a genome, which is a blueprint of the highly complicated living organ- 
isms. To understand living organisms by utilizing the genomic information from various species is not only important 
as an academic subject, but also socially significant from the viewpoint of industrial application. 

20 [0003] However, determination of genomic sequences itself cannot identify the functions of all genes. For example, 
as for yeast, only the function of approximately half of the 6000 genes, which, is predicted based on the genomic 
sequence, was able to be deduced. On the other hand, the human genome has been estimated to contain about 
30,000-40,000 genes. Further, 1 00,000 or more types of mRNAs are said to exist when variants produced by alternative 
splicing are taken into consideration. Therefore, it is desirable to establish "a high throughput analysis system of the 

25 gene functions" which allows us to identify rapidly and efficiently the functions of vast amounts of the genes obtained 
by the genomic sequencing. 

[0004] Many genes in the eukaryotic genome are split by introns into multiple exons. Thus, it is difficult to predict 
correctly the structure of encoded protein solely based on genomic information. In contrast, cDNA, which is produced 
from mRNA that lacks introns, encodes a protein as a single continuous amino acid sequence and allows us to identify 

30 the primary structure of the protein easily. In human cDNA research, to date, more than three million ESTs (Expression 
Sequence Tags) are publicly available, and the ESTs presumably cover not less than 80% of all human genes. 
[0005] The information of ESTs is utilized for analyzing the structure of human genome, or for predicting the exon- 
regions of genomic sequences or their expression profile. However, many human ESTs have been derived from prox- 
imal regions to the 3'-end of cDNA, and information around the 5'-end of mRNA is extremely little. Among human 

35 cDNAs, the number of the corresponding mRNAs whose encoding full-length protein sequences are deduced is ap- 
proximately 13,000. 

[0006] It is possible to identify the transcription start site of mRNA on the genomic sequence based on the 5'-end 
sequence of a full-length cDNA, and to analyze factors involved in the stability of mRNA that is contained in the cDNA, 
or in its regulation of expression at the translation stage. Also, since a full-length cDNA contains atg codon, the trans- 

40 lation start site, in the 5'-region, it can be translated into a protein in a correct frame. Therefore, it is possible to produce 
a large amount of the protein encoded by the cDNA or to analyze biological activity of the expressed protein by utilizing 
an appropriate expression system. Thus, analysis of a full-length cDNA provides valuable information which comple- 
ments the information from genome sequencing. Also, full-length cDNA clones that can be expressed are extremely 
valuable in empirical analysis of gene function and in industrial application. 

45 [0007] Therefore, if a novel human full-length cDNA is isolated, it can be used for developing medicines for diseases 
in which the gene is involved. The protein encoded by the gene can be used as a drug by itself. Thus, it has great 
significance to obtain a full-length cDNA encoding a novel human protein. 

[0008] In particular, human secretory proteins or membrane proteins would be useful by itself as a medicine like 
tissue plasminogen activator (TPA), or as a target of medicines like membrane receptors. In addition, genes for signal 

so transduction-related proteins (protein kinases, etc.), glycoprotein-related proteins, transcription-related proteins, etc. 
are genes whose relationships to human diseases have been elucidated. Moreover, genes for disease-related proteins 
form a gene group rich in genes whose relationships to human diseases have been elucidated. 
[0009] Therefore, it has great significance to isolate novel full-length cDNA clones of human, only few of which has 
been isolated. Especially, isolation of a novel cDNA clone encoding a secretory protein or membrane protein is desired 

55 since the protein itself would be useful as a medicine, and also the clones potentially include a gene involved in diseases. 
In addition, genes encoding proteins that are involved in signal transduction, glycoprotein, transcription, or diseases 
are expected to be useful as target molecules for therapy, or as medicines themselves. These genes form a gene 
group predicted to be strongly involved in diseases. Thus, identification of the full-length cDNA clones encoding those 
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proteins has great significance. 

SUMMARY OF THE INVENTION 

5 [001 0] An objective of the present invention is to provide polynucleotides encoding novel polypeptides, polypeptides 
encoded by the polynucleotides, and novel usages of these. 

[001 1] The inventors have developed a method for efficiently cloning, from a cDNA library having very high fullness- 
ratio, a human full-length cDNA that is predicted to be a full-length cDNA clone, where the cDNA library is synthesized 
by an improved method (WO 01/04286) of the oligo-capping method (K. Maruyama and S. Sugano, Gene, 1 38: 1 71 -1 74 
w (1994); Y. Suzuki et al., Gene, 200: 149-156 (1997)). Then, the nucleotide sequences of cDNA clones whose fullness 
ratio is high, obtained by this method, were determined mainly from their 5'-ends, and, if required, from 3'-ends. 
[001 2] Further, representative clones, which were estimated to be novel and full-length, among the clones obtained, 
were analyzed for their full-length nucleotide sequences. The determined full-length nucleotide sequences were ana- 
lyzed by BLAST homology search of the databases shown below. Because the homology search of the present inven- 
ts tion is carried out based on the information of full-length cDNAs including the entire coding regions, homology to every 
part of a polypeptide can be analyzed. Thus, in the present invention, the reliability of homology search has been 
greatly improved. 

[1] SwissProt 

20 (http://www.ebi.ac.uk/ebi_docsSwissProt_db/swisshome.html), 
[2] GenBank (http://www.ncbi.nlm.nih.gov/web/GenBank), 
[3] UniGene (Human) (http://www.ncbi.nlm.nih.gov/UniGene), and 

[4] nr (a protein database, which has been constructed by combining data of coding sequences (CDS) in nucleotide 
sequences deposited in GenBank, and data of SwissProt, PDB (http://www.rcsb.org/pdb/index.html), PIR (http:// 
25 pir.georgetown.edu/pirwww/pirhome.shtml), and PRF (http://www.prf .or.jp/en/); overlapping sequences have been 

removed.) 

[001 3] Further, the gene expression profiles of cDN A clones whose fu ll-length nucleotide sequence had been deter- 
mined were studied by analyzing the large-scale cDNA database constructed based on the 5'-end nucleotide sequenc- 
30 es of cDNAs obtained. In addition to the analysis for the expression profile by computer, the profiles of gene expression 
in living cells were also determined by PCR. The present inventors revealed the usefulness of the genes of the present 
invention based on these analysis results. 

[0014] In the present invention, gene functions were revealed by the analysis of expression profiles in silico based 
on the information of full-length nucleotide sequences. The expression profiles used in the expression frequency anal- 

35 ysis were studied based on the database containing sufficient amount of fragment sequence data. The expression 
frequency analysis was carried out by referring, for these expression profiles, to the full-length nucleotide sequences 
of many cDNA clones obtained in the present invention. Thus, a highly reliable analysis can be achieved by referring 
to the full-length nucleotide sequences of a wide variety of genes for the sufficiently large population for analysis (ex- 
pression profiles). Namely, the results of expression frequency analysis using the full-length sequences of the present 

40 invention more precisely reflect the gene expression frequency in tissues and cells from which a certain cDNA library 
was derived. In other words, the information of full-length cDNA nucleotide sequence of the present invention made it 
possible to achieve the highly reliable expression frequency analysis. 

[0015] The full-length cDNA clones of this invention were obtained by the method comprising the steps of [1] preparing 
libraries containing cDNAs with the high fullness ratio by oligo-capping, and [2] assembling 5'-end sequences and 

45 selecting one with the highest probability of completeness in length- in the cluster formed (there are many clones longer 
in the 5'-end direction). However, the uses of primers designed based on the 5'- and 3'-end sequences of polynucle- 
otides provided by the present invention enable readily obtaining full-length cDNAs without such a special technique. 
The primer, which is designed to be used for obtaining cDNAs capable of being expressed, is not limited to the 5'- and 
3' -end sequences of polynucleotide. 

so [0016] Specifically, the present invention relates to a polynucleotide selected from the group consisting of the fol- 
lowing (a) to (g): 

(a) a polynucleotide comprising a protein-coding region of the nucleotide sequence of any one of SEQ ID NOs 
shown in Table 1 ; 

55 (b) a polynucleotide encoding a polypeptide comprising the amino acid sequence of any one of SEQ ID NOs shown 

in Table 1 ; 

(c) a polynucleotide comprising a nucleotide sequence encoding a polypeptide comprising the amino acid se- 
quence of any one of SEQ ID NOs shown in Table 1, wherein, in said amino acid sequence, one or more amino 
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acids have been substituted, deleted, inserted, and/or added, and wherein said nucleotide sequence encodes a 
polypeptide functionally equivalent to a polypeptide comprising the selected amino acid sequence; 

(d) a polynucleotide hybridizing under stringent conditions to a polynucleotide comprising the nucleotide sequence 
of any one of SEQ ID NOs shown in Table 1 , wherein said nucleotide sequence encodes a polypeptide functionally 

5 equivalent to a polypeptide encoded by the selected nucleotide sequence; 

(e) a polynucleotide comprising a nucleotide sequence encoding a partial amino acid sequence of a polypeptide 
encoded by the polynucleotide according to any one of (a) to (d); 

(f) a polynucleotide comprising a nucleotide sequence having at least 70% identity to the nucleotide sequence of 
(a); and 

w (g) a polynucleotide comprising a nucleotide sequence having at least 90% identity to the nucleotide sequence of 

(a). 

[0017] The present invention also relates to a polypeptide encoded by the above-mentioned polynucleotide or a 
partial peptide thereof, an antibody binding to the polypeptide or the peptide, and a method for immunologically assaying 
15 the polypeptide or the peptide, which comprises the steps of contacting the polypeptide or the peptide with the antibody, 
and observing the binding between the two. 

[0018] Furthermore, the present invention features a vector comprising the above-mentioned polynucleotide, a trans- 
formant carrying the polynucleotide or the vector, a transformant carrying the polynucleotide or the vector in an ex- 
pressible manner, and a method for producing the polypeptide or the peptide, which comprises the steps of culturing 

20 the transformant and recovering an expression product. 

[0019] Another feature of the present invention is an oligonucleotide comprising at least 15 nucleotides, said oligo- 
nucleotide comprising a nucleotide sequence complementary to the nucleotide sequence of any one of SEQ ID NOs: 
1 to 1970 or to a complementary strand thereof. This oligonucleotide can be used as a primer for synthesizing the 
above-mentioned polynucleotide or used as a probe for detecting the polynucleotide. The present invention includes 

25 an antisense polynucleotide against the polynucleotide or a part thereof, and a method for detecting the polynucleotide, 
which comprises the following steps of: 

a) incubating a target polynucleotide with the oligonucleotide under hybridizable conditions, and 

b) detecting hybridization of the target polynucleotide with the oligonucleotide. 

30 

[0020] Still another feature of the present invention is a database of polynucleotides and/or polypeptides, said da- 
tabase comprising information on at least one of the nucleotide sequences of SEQ ID NOs: 1 to 1970 and/or on at 
least one of the amino acid sequences of SEQ ID NOs: 1971 to 3940. 

[0021] Herein, "polynucleotide" is defined as a molecule, such as DNA and RNA, in which multiple nucleotides are 

35 polymerized. There are no limitations on the number of the polymerized nucleotides. In case that the polymer contains 
relatively low number of nucleotides, it is also described as an "oligonucleotide", which is included in the "polynucleotide" 
of the present invention. The polynucleotide or the oligonucleotide of the present invention can be a natural or chemically 
synthesized product. Alternatively, it can be synthesized using a template polynucleotide by an enzymatic reaction 
such as PCR. Furthermore, the polynucleotide of the present invention may be modified chemically. Moreover, not only 

40 a single-strand polynucleotide but also a double-strand polynucleotide is included in the present invention. In this 
specification, especially in claims, when the polynucleotide is described merely as "polynucleotide", it means not only 
a single-strand polynucleotide but also a double-strand polynucleotide. When it means double-strand polynucleotide, 
the nucleotide sequence of only one chain is indicated. However, based on the nucleotide sequence of a sense chain, 
the nucleotide sequence of the complementary strand thereof is essentially determined. 

45 [0022] As used herein, an "isolated polynucleotide" is a polynucleotide the structure of which is not identical to that 
of any naturally occurring polynucleotide or to that of any fragment of a naturally occurring genomic polynucleotide 
spanning more than three separate genes. The term therefore includes, for example, (a) a DNA which has the sequence 
of part of a naturally occurring genomic DNA molecule in the genome of the organism in which it naturally occurs; (b) 
a polynucleotide incorporated into a vector or into the genomic DNA of a prokaryote or eukaryote in a manner such 

50 that the resulting molecule is not identical to any naturally occurring vector or genomic DNA; (c) a separate molecule 
such as a cDNA, a genomic fragment, a fragment produced by polymerase chain reaction (PCR), or a restriction 
fragment; and (d) a recombinant nucleotide sequence that is part of a hybrid gene, i.e., a gene encoding a fusion 
polypeptide. Specifically excluded from this definition are polynucleotides of DNA molecules present in mixtures of 
different (i) DNA molecules, (ii) transfected cells, or (iii) cell clones; e.g., as these occur in a DNA library such as a 

55 cDNA or genomic DNA library. 

[0023] The term "substantially pure" as used herein in reference to a given protein or polypeptide means that the 
protein or polypeptide is substantially free from other biological macromolecules. For example, the substantially pure 
protein or polypeptide is at least 75%, 80%, 85%, 95%, or 99% pure by dry weight. Purity can be measured by any 
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appropriate standard method known in the art, for example, by coiumn chromatography, polyacrylamide gel electro- 
phoresis., or HPLC analysis. 

[0024] All the cDNAs provided by the present invention are full-length cDNAs. The "full-length cDNA" herein means 
that the cDNA contains the ATG codon, which is the start point of translation therein. The untranslated regions upstream 
5 and downstream of the protein-coding region, both of which are naturally contained in natural mRNAs, are not indis- 
pensable. It is preferable that the full-length cDNAs of the present invention contain the stop codon. 

BRIEF DESCRIPTION OF THE DRAWINGS 

10 [0025] 

Figure 1 shows the restriction map of the vector pME18SFL3. 
DETAILED DESCRIPTION OF THE INVENTION 

15 

[0026] All the clones (1 970 clones) of the present invention are novel and encode the full-length polypeptides. Further, 
all the clones are cDNAs with the high fullness ratio, which were obtained by oligo-capping method, and also clones 
which are not identical to any of known human mRNAs (namely, novel clones) selected by searching, for the 5'-end 
sequences, mRNA sequences with the annotation of "complete cds" in the GenBank and UniGene databases by using 
20 the BLAST homology search [S. F. Aftschul, W. Gish, W. Miller, E. W. Myers & D. J. Lipman, J. Mol. Biol., 215:403-410 
(1990); W. Gish & D. J. States, Nature Genet., 3: 266-272 (1993)]; they are also clones that were assumed to have 
higher fullness ratio among the members in the cluster formed by assembling. Most of the clones assessed to have 
high fullness ratio in the cluster had the nucleotide sequences longer in the 5'-end direction. 

[0027] All the full-length cDNAs of the present invention can be synthesized by a method such as PCR (Current 
25 protocols in Molecular Biology edit. Ausubel et al. (1987) Publish. John Wiley & Sons Section 6.1-6.4) using primer 
sets designed based on the 5'-end and 3'-end sequences or using primer sets of primers designed based on the 5'- 
end sequences and a primer of oligo dT sequence corresponding to poly A sequence. Table 1 contains the clone names 
of full-length cDNA of 1970 clones of the present invention, SEQ ID NOs of the full-length nucleotide sequences, CDS 
portions deduced from the full-length nucleotide sequences, and SEQ ID NOs of the translated amino acids. The 
30 positions of CDS are shown according to the rule of "DDBJ/EMBL/GenBank Feature Table Definition" (http://www.ncbi. 
nlm.nih.gov/collab/FT/index.html). The start position number corresponds to the first letter of "ATG" that is the nucle- 
otide triplet encoding methionine; the termination position number corresponds to the third letter of the stop codon. 
These are indicated being flanked with the mark However, with respect to the clones having no stop codon, the 
termination position is indicated by the mark ">" according to the above rule. 

35 



Table 1 





Clone name 


SEQ ID NO. of nucleotide 


Position of CDS 


SEQ ID NO. of amino acid 






sequence 




sequence 


40 


ADRGL20020290 


1 


62. .1669 


1971 




ADRGL20021910 


2 


150. .707 


1972 




ADRGL20022600 


3 


464.. 81 4 


1973 




ADRGL20023920 


4 


375.. 1853 


1974 




ADRGL20026790 


5 


11 3. .2371 


1975 


45 


ADRGL20027530 


6 


201 3. .2372 


1976 




ADRGL20036380 


7 


1129.. 1557 


1977 




ADRGL20036840 


8 


260. .889 


1978 




ADRGL20040310 


9 


539.. 844 


1979 


50 


ADRGL20040770 


10 


817.. 1227 


1980 


ADRGL20046760 


11 


1108.. 1467 


1981 




ADRGL20047080 


12 


823..1134 


1982 




ADRGL20047770 


13 


1532.. 1897 


1983 




ADRGL20057560 


14 


376.. 846 


1984 


55 


ADRGL20059610 


15 


969.. 1961 


1985 




ADRGL20062330 


16 


799..>2117 


1986 




ADRGL20063770 


17 


344.. 664 


1987 
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Table 1 (continued) 



5 



w 



15 



20 



25 



Glone name 


SEQ ID NO of nucleotide 


Position of CDS 


SEQ ID NO of amino acid 

K^s l_ II—/ Ii w. vl C« 1 1 III 1 w QUIU 




sequence 




sequence 


ADRGL20066770 


18 


22. .1416 


1 988 


ADRGL20067320 


19 


276..1016 


1 989 


ADRGL20079060 


20 


63.. 1748 


1 990 


ADRGL20095330 


21 


929 1516 


1991 


ASTR020001 910 


22 


1753 >2216 


1 992 


ASTR020003720 


23 


2089 2586 


1 993 


ASTR020004820 


24 


444.. 1040 


1 994 




25 


2.. 11 23 


I C7CIO 


/V^-> 1 I lUtvUv^ 1 *+v/ 


26 


344. .1 71 4 


1 996 


A^TRn^nm nm n 

no i i lwcuu i v>vs I vf 


27 


1 236 1 727 


1 997 




28 


2.. 745 


1 998 


A^TRHOOm 997n 


9Q 


286 627 


1 QQQ 

i yyy 


a <^TRn9nn9n.94n 




1 0 aaR 
I .ooo 


9nnn 


a qtr 09nn9nQRn 

/AO 1 nU^UU^UOJU 


a1 


1 004,. 1 OOH 


9nm 


a c >TRn9nn99n9n 

/AO 1 nU^UU££U£U 


ft9 


4R7 1 OQ^ 
fO / . . I L/S70 




/AO 1 nUtUUtOOty 




47*3 91 R1 
«+/ O. I O I 


9nn^ 


a cn~Rn9nn97ftftn 


a4 


4ft1 11H1 

*+0 I . . 1 1 1 




/AO 1 nUtuU004Uy 


oo 


R9 909R 


9nnR 


A^TRnonnzi^R^in 

MO i nu^L/w £ too t nj 


*}R 


^DO. . I (JO I 


ofinR 

cUUO 


A^TRn9nn4R9fin 


o / 


1 4n 1 4n9 


9007 


a ^TRn9nn47R 1 n 


oo 


I9dn 1R11 

1 t'tU.. toil 




mo i nUiiuujuo i \j 


3Q 


171 1 RQ4 

1 / 1 . . 1 Oi7*t 


9nno 


a c ?TRn9nn c i949n 


4n 

*+v/ 


11RR 9^R9 


9m n 


a ^ t r n p nn ^ 

mo i nUiiuuj jh ou 


41 


91ft 1 ftftR 
c IO,, I ooo 


9H1 1 


a < ^TRn9nnRR^ / *n. 


49 


cUa. .ooy 


9H1 9 


a ^TRnonn^R^yn 

MO 1 nUiiUUOjO / \J 


4*3 
*+o 


949 7^^ 


901 ^ 


mo i nu^uuooyou 


44 

*4*+ 


QytQ 1 ORR 
OHO. . I UOD 


9m 4 


mo i nutuuooaou 


4R 
*+o 


cc 101R 
OO. . Ic IO 


£.yj i o 


a QTRnonnRQonn 
mo i nutuuoy/iuu 


4R 


71 

/ 1 . . 1 ^O/ 


QH1 R 
^Ul o 


mo i nu^uu / o i ou 


47 


1 nn4 1 7QR 
t uu*f .. i /yo 


QH1 7 


mo i riu^uu/ooou 


4ft 
*+o 


1 CQA 1 QRft 

i oy*k. l yoo 


Q01 ft 


MO 1 nu^UUuOUOU 


4Q 


A~7CS 0011 
4/U. ,cUl I 


on 1 q 
tui y 


mo i nu^uuooaou 


ou 


0*1-0. . 1 oou 


often 


MO 1 nUtUUObOUu 


O 1 


1 ao 11 ot; 

I ..11 <10 


onoi 


a Q~rRn9nnQn.Rftn 
Mo i nucuubuoou 


CO 


\ ,.><COO 1 




Ao 1 riU^UUy 1 1 oU 


CO 

OO 


i O 104/1 




ASTR020091770 


54 


10..318 


2024 


ASTR020141740 


55 


30.. 347 


2025 


BGG1 120000670 


56 


240.. 61 4 


2026 


BGGI1 2001 0750 


57 


72..>2507 


2027 


BNGH41 0000570 


58 


957.. 2027 


2028 


BNGH420008150 


59 


596. .1606 


2029 


BNGH420014060 


60 


1072. .1413 


2030 


BNGH42001 5760 


61 


92. .1336 


2031 


BNGH420021680 


62 


147. 2093 


2032 


BNGH420023870 


63 


301. .1851 


2033 


BNGH420024870 


64 


155.. 1960 


2034 


BNGH420035290 


65 


114.2126 


2035 


BNGH420036410 


66 


1778..2143 


2036 
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Table 1 (continued) 



5 



w 



15 



20 



25 



35 



Clone name 


SEQ ID NO. of nucleotide 


Position of CDS 


SEQ ID NO. of amino acid 




sequence 




sequence 


BNGH420040760 


67 


698.. 11 62 


2037 


BNGH420042910 


68 


191. .721 


2038 


BNGH420045380 


69 


721. .1254 


2039 


BNGH420046790 


70 


1079.. 1429 


2040 


BNGH420052350 


71 


787..1146 


2041 


BNGH420059680 


72 


213..2393 


2042 


BNGH420061350 


73 


600..3131 


2043 


BNGH420062340 


74 


294.. 641 


2044 


BNGH420070370 


75 


567.. 2429 


2045 


BNGH420074600 


76 


125. 1696 


2046 


BNGH420075940 


77 


31. .510 


2047 


BNGH42007798 


78 


288..2147 


2048 


BNGH420085100 


79 


243. .569 


2049 


BNGH420086030 


80 


107..>2556 


2050 


BNGH420087430 


81 


195.. 1835 


2051 


BRACE1 0000510 


82 


642. 1703 


2052 


BRACE20003310 


83 


1379.. 2893 


2053 


BRACE20007330 


84 


339. .2015 


2054 


BRACE20009050 


85 


1023.. 1493 


2055 


BRACE20014450 


86 


161. .1096 


2056 


BRACE20017790 


87 


304.. 639 


2057 


BRACE200 18810 


88 


397.. 11 46 


2058 


BRACE20025820 


89 


453.. 767 


2059 


BRACE20038920 


90 


1122. .1463 


2060 


BRACE20050870 


91 


110. .1987 


2061 


BRACE20051600 


92 


443. .895 


2062 


BRACE20051930 


93 


173. .901 


2063 


BRACE20052430 


94 


825.. 1637 


2064 


BRACE20052530 


95 


98. .490 


2065 


BRACE20054080 


96 


762.. 1382 


2066 


BRACE20054480 


97 


111 ..902 


2067 


BRACE20054600 


98 


392.. 1276 


2068 


BRACE20055560 


99 


136.. 735 


2069 


BRACE20057870 


100 


11 69. .1825 


2070 


BRACE200591 1 0 


101 


1452. .1910 


2071 


BRACE2005981 0 


102 


689. .221 8 


2072 


BRACE20061620 


103 


162.. 1163 


2073 


BRACE20062580 


104 


1164. .1859 


2074 


BRACE20063540 


105 


378.. 1670 


2075 


BRACE20065470 


106 


427..1101 


2076 


BRACE20066360 


107 


233.. 736 


2077 


BRACE20068710 


108 


1099..1440 


2078 


BRACE20069000 


109 


1355.. 2305 


2079 


BRACE20069110 


110 


576.. 91 7 


2080 


BRACE20069440 


111 


278.. 1504 


2081 


BRACE20079200 


112 


| 928.. 1413 


2082 


BRACE20079370 


113 


158.. 1522 


2083 


BRACE20097540 


114 


1474.. 21 03 


2084 


BRACE20098860 


115 


693.. 11 93 


2085 
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Table 1 (continued) 



5 



10 



15 



20 



25 



Clone name 


SEQ ID NO. of nucleotide 


Position of CDS 


SEQ ID NO. of amino acid 




sequence 


* 


sequence 


BRACE20099070 


116 


53.. 1441 


2086 


BRACE201 94670 


117 


11. .616 


2087 


BRACE20196180 


118 


35.. 91 6 


2088 


BRACE201 96960 


119 


1454.. 1912 


2089 


BRACE20200770 


120 


306.. 683 


2090 


BRACE20200970 


121 


426.. 764 


2091 


B RAC E20204670 


122 


760.. 2 124 


2092 


BRACE20205840 


123 


40.. 387 


2093 


BRACE20207420 


124 


11 9.. 469 


2094 


B RAC E202 1 2450 


125 


168. .590 


2095 


BRACE20215410 


126 


111 ..1361 


2096 


BRACE2021 6700 


127 


1 403 1 738 

1 *TVO ..II \J\J 


2097 


BRACE2021 6950 


128 


911 1315 


2098 


BRACE2021 9360 


129 


198.. 596 


2099 


BRAMY1 0000980 


130 


254..616 


21 00 


BRAMY1 0001 730 


131 


796 1 1 58 


21 01 


BRAMY2000021 0 


132 


134.. 445 


21 02 


BRAMY20000250 


133 


1 90 1 932 


21 03 


BRAMY2000151 0 


134 


129. .91 7 


21 04 


BRAMY20003540 


135 


144.. 2477 


21 05 


BRAMY20003880 


136 


191 808 


21 06 


B R AM Y20 005080 


137 


1 638 1 958 


21 07 


BRAMY2001 3670 


138 


551 2881 


21 08 


BRAMY2001 6780 


139 


273 1 985 


21 09 


B RAM Y20020440 


140 


359 685 


2110 


BRAMY20021 580 


141 


67.. 555 


211 1 


BRAMY20023390 


142 


1 568 1 939 


2112 


BRAMY20023640 


143 


1684 2280 

1 UUt. ,f r QU 


2113 


RRAMY20024790 


144 


276.. 626 


2114 


BRAMY20027390 


145 


420.. 782 


2115 


BRAMY20027990 


146 


529 1 572 


2116 


BRAMY20028530 


147 


973 1 278 


211 7 


BRAMY20028620 


148 


1 048 1 434 


211 ft 


RRAMY20035380 


149 


92*1 1 707 


CM 1 Q 


BRAMY20035830 


150 


21 9. .977 


51 9fl 


RRAMYPnmfiR^n 

Dri/AiVl I cvUOOSOU 


1 ra 


1411 17R1 

it i i .. i / □ i 


OA OA 


BRAMY20036810 


152 


321.. 644 


2122 


BRAMY20038980 


153 


715..>2057 


2123 


BRAMY20039290 


154 


81. .1043 


2124 


BRAMY20040580 


155 


374.. 769 


2125 


BRAMY20043520 


156 


942.. 1778 


2126 


BRAMY20043630 


157 


25. .1119 


2127 


BRAMY20044920 


158 


40.. 1947 


2128 


BRAMY20045210 


159 


367.. 750 


2129 


BRAMY20045420 


160 


4.. 888 


2130 


BRAMY20047560 


161 


220. 726 


2131 


BRAMY20050640 


162 


1802.. 2635 


2132 


BRAMY20050940 


163 


23. .385 


2133 


BRAMY20051820 


164 


1411. .2157 


2134 
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Table 1 (continued) 



5 



10 



15 



20 



25 



Clone name 


SEQ ID NO. of nucleotide 


Position of CDS 


SEQ ID NO. of amino acid 




sequence 




sequence 


BRAMY20052440 


165 


29. .448 


2135 


BRAMY20053910 


166 


11. .358 


2136 


BRAMY20055760 


167 


664.. 2253 


2137 


BRAMY20056620 


168 


46.. 726 


2138 


BRAMY20056840 


169 


40.. 1392 


2139 


BRAMY20063750 


170 


535. .21 66 


2140 


BRAMY20072440 


171 


823.. 1620 


2141 


BRAMY20072870 


172 


437.. 910 


2142 


BRAMY20073080 


173 


3.. 353 


2143 


BRAMY20074110 


174 


31 4. .838 


2144 


B RAM Y20074860 


175 


11 35. .1524 


2145 


BRAMY200761 00 


176 


459. .101 9 


2146 


BRAMY200761 30 


177 


59.. 376 


2147 


BRAMY20076530 


178 


1010. .1465 


2148 


BRAMY20083330 


179 


10.. 492 


2149 


BRAMY20083820 


1 80 


625 1113 

\J£—^J ..III 


2150 


BRAMY20089770 


181 


1 73 955 


2151 


BRAMY20091 230 


182 


818. .1579 


2152 


BRAMY20093490 


183 


637.. 1080 


2153 


BRAMY20094890 


184 


30. .2138 


2154 


BRAMY20095080 


185 


239. .613 


2155 


B RAM Y20095570 


186 


109 807 


2156 


BRAMY20096930 


187 


1381 ..1800 


2157 


B RAMY20 1 00680 


188 


268. .870 


2158 


BRAMY201 02900 


189 


200 760 


2159 


BRAMY201 07980 


190 


343.. 669 


21 60 


BRAMY20111780 


191 


584.. 2209 


2161 


BRAMY201 1 7670 


192 


63.. 782 


21 62 


BRAMY201 18410 


193 


24..782 


21 63 


BRAMY201 18490 


194 


39. .791 


21 64 


BRAMY20120170 


195 


1130 1459 


21 65 


BRAMY201 23400 


1 96 


1 420 1 755 

1 *-tGJ\J ..It JO 


21 66 


BRAMY20 124970 


1 97 


825 1 226 


21 67 


R RAM Y20 1 25 1 70 


1 98 


157. .579 


21 68 


BRAMY201 25360 

U I l#\IVI 1 £—\J I L.JwUv 


1 99 




21 69 


RRAMY201 25550 


200 


29.. 1 747 


21 70 


BRAMY201 26910 


201 


114..518 


2171 


BRAMY201 27310 


202 


1986.. 2336 


2172 


BRAMY201 27760 


203 


31 7.. 691 


2173 


BRAMY201 34050 


204 


199. 522 


2174 


BRAMY201 35720 


205 


57.. 401 


2175 


BRAMY201 37360 


206 


976. .21 93 


2176 


BRAMY201 39440 


207 


2.. 1597 


2177 


BRAMY201 39750 


208 


88.. 435 


2178 


BRAMY201 43870 


209 


1419..2102 


2179 


BRAMY201 52510 


210 


296.. 1993 


2180 


BRAMY201 55500 


211 


722.. 1069 


2181 


BRAMY20 158550 


212 


142.. 951 


2182 


BRAMY201 59250 


213 


286.. 810 


2183 
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Table 1 (continued) 



5 



10 



15 



20 



25 



30 



35 



Clone name 


SEQ ID NO. of nucleotide 


Position of CDS 


SEQ ID NO. of amino acid 




sequence 




sequence 


BRAMY201 60020 


214 


143. .919 


2184 


BRAMY201 73480 


215 


1544. .1906 


2185 


BRAMY201 90550 


216 


114.. 1640 


2186 


BRAMY201 94680 


217 


1179. .1517 


2187 


BRAMY20204270 


218 


21 8. .844 


2188 


BRAMY20206340 


219 


237. 1805 


21 89 


BRAMY20219620 


220 


1014. .1670 


2190 


BRAMY20221600 


221 


168. 992 


2191 


BRAMY20223010 


222 


466.. 1257 


2192 


BRAMY20225250 


223 


368.. 673 


21 93 


BRAMY20225320 


224 


241 ..840 


21 94 


BRAMY20227230 


225 


1327. .2067 


21 95 


BRAMY20227860 


226 


10. .657 


21 96 


BRAMY20227960 


227 


1148. .1558 


21 97 


BRAMY20231150 


228 


199. .1146 


21 98 


BRAMY20234820 


229 


199.. 2079 


21 99 


BRAMY202371 90 


230 


397 870 


2200 


BRAMY20238630 


231 


303.. 1331 


2201 


BRAMY202431 20 


232 


1756.. 2451 


2202 


BRAMY20244490 


233 


257. .982 


2203 


BRAMY202451 40 


234 


3. .1295 


2204 


BRAMY20245350 


235 


94. .750 


2205 


BRAMY20245760 


236 


35 1375 


2206 


BRAMY20251 21 0 


237 




2207 


BRAMY20251 750 


238 


97. .846 


2208 


BRAMY20263000 


239 


216 1553 


2209 


BRAMY20267780 


240 


1194 1706 


221 0 


BRAMY20269040 


241 


989 2071 


221 1 


BRAMY20271140 


242 


1582 2238 


221 2 


BRAMY2027451 0 


243 


1 / OJ. . t_ 1 oo 


PP1 ^ 

I O 


BRAMY20285650 


244 


23.. 382 


C.C. 1 *+ 


BRAMY20287400 


245 


1 ..456 


1 Q 


BRAWH20014590 


246 


125.. 856 


221 6 


RRAWH20090470 


247 


IO 1 . . -> (L- \ OU 


jC^I I / 


BRAWH20020600 


248 


402.. 722 


221 8 


BRAWH20021 Q1 0 


249 




oo\ Q 


BRAWH20025490 


250 


1699..2106 


2220 


BRAWH20026010 


251 


307.. 2034 


2221 


BRAWH20027250 


252 


942.. 1499 


2222 


BRAWH20030000 


253 


381. .1286 


2223 


BRAWH20039640 


254 


109..>2281 


2224 


BRAWH20040680 


255 


201.. 2291 


2225 


BRAWH20047790 


256 


290.. 631 


2226 


BRAWH20050740 


257 


512..>1907 


2227 


BRAWH20055240 


258 


1339.. 1653 


2228 


BRAWH20055330 


259 


1507.. 1911 


2229 


BRAWH20055780 


260 


354.. 953 


2230 


BRAWH20058120 


261 


896.. 1501 


2231 


BRAWH20063010 


262 


1839.. 2579 


2232 
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Table 1 (continued) 



5 



10 



15 



20 



25 



Clone name 


SEQ ID NO. of nucleotide 
sequence 


Position of CDS 


SEQ ID NO. of amino acid 
sequence 


BRAWH20078080 


263 


161. .745 


2233 


BRAWH20078620 


264 


99.. 470 


2234 


BRAWH20080580 


265 


35.. 1000 


2235 


BRAWH20082550 


266 


2191. .2670 


2236 


BRAWH20082920 


267 


1579..2124 


2237 


BRAWH20093040 


268 


1202. 1855 


2238 


BRAWH20093070 


269 


469. 1569 


2239 


BRAWH20094900 


270 


712. .2166 


2240 


BRAWH20095900 


271 


247. .21 78 


2241 


BRAWH201 73790 


272 


634.. 1428 


2242 


BRAWH201 74330 


273 


2532.. 3272 


2243 


BRAWH201 75230 


274 


1119. .1451 


2244 


BRAWH201 75340 


275 


88.. 504 


2245 


BRAWH201 76850 


276 


32.. 2203 


2246 


BRAWH201 82670 


277 


2751 ..3059 


2247 


BRAWH201 831 70 


278 


106. .909 


2248 


BRAWH201 85260 

I—* 1 ir\i Ti l C— w i uuluv 


279 


204 1946 


2249 


BRAWH201 85270 

I— ' 1 \i \ f f 1 1 £— \J 1 Www I V 


280 


15.. 869 


2250 


BRAWH201 8601 0 


281 


886. .1389 


2251 


BRAWH20 1 88750 


282 


21 .824 


2252 


BRAWH201 90530 


283 


410.. 1024 


2253 


BRAWH201 90550 


284 


55. .1533 


2254 


BRAWH201 91 980 


285 


1426. .21 72 


2255 


BRCAN 1 0000760 


286 


548 1885 

W^TWi > 1 W>WW 


2256 


BRCAN 1 0001 050 


287 


388 828 

www » . w w w 


2257 


BRCAN 10001 680 

L-J 1 lv/ni il Iwww 1 www 


288 


519 998 

W 1 w . ■ w w w 


2258 


BRCAN20001480 


289 


11 4. .449 


2259 


BRCAN200041 80 

l_J 1 1 N C- W w W" 1 WW 


290 


8. .331 


2260 


BRCAN20005230 


291 


63 590 

WW . .Uv7U 


2261 


BRCAN2000541 0 


292 


52 1335 


2262 


BRCOC1 0000400 

Ul 1 w W W W^ W W 


293 


11 ..3664 


2263 


BRCOC20000470 

1 — * \ IW WW W W W W W*T / V 


294 


283 1851 


2264 


BRCOC20003600 

Ul IW w £- W \J Ww U w W 


295 


1 63 1 455 


2265 ! 


RRHIP1 0000720 


296 


4. .31 2 


2266 


RRHIP1 0001 040 

uni iii i uuu i u^v 


297 


76. .131 7 


2267 


RRHIP2000021 0 


298 


7 373 




BRHIP20003590 


299 


618.. 1238 


2269 


BRHIP20005060 


300 


756.. 11 78 


2270 


BRSSN20001970 


301 


89.. 526 


2271 


BRSSN20005610 


302 


174..>2375 


2272 


BRSSN20005660 


303 


2089..>2535 


2273 


BRSSN20066440 


304 


248.. 1474 


2274 


BRSSN20074640 


305 


257.. 985 


2275 


BRSSN20091190 


306 


1214..2002 


2276 


BRSSN20092440 


307 


4..396 


2277 


BRSSN20093890 


308 


94. .717 


2278 


CD34C20001 750 


309 


10.. 840 


2279 


CTONG1 0000090 


310 


2551. .2991 


2280 


CTONG20000340 


311 


856.. 1929 


2281 
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Table 1 (continued) 



5 



w 



15 



20 



25 



whjiic i lai I ic 


O LW 1 1— / IM VJ. U 1 1 IUL.IUUUUC 


Pncitinn nf 


OCfi | Pi MO nf aminn anlH 




sequence 




sequence 


CTON G20002790 


312 


373 708 

O / O . . / uU 


2282 


CTONG20004110 


313 


74.. 3079 


2283 


CTONG20004520 


314 


1 46. .772 


OP 84 




O I o 


1 QO 1 A/1ft 


OOOC 


CTONGP00081 Q0 


316 


1313 178Q 




CT0NGP00084 


31 7 


1 U04. . 1 Of?? 


OPR7 


CTONGP001 *>P40 


31 8 


1 ^d. 87/1 


OOOD 


OTONIf^Onni 7AQf) 

O 1 V_/|>JVJt^UU 1 / t +v?U 


^1 Q 

O I i7 




OOQQ 


PTONGPOOPOfifin 


^on 


c iy. .000 


OOQH 






OA qOR? 


ooqi 


p"TO m f^o n n o 7R r n 




1 uo..y^4c 


OOQO 


pto m oo n n O QCl *\C\ 




1 HO 04QO 


OOQQ 


V-/ 1 v_^in vji<iuuou*iou 




OIQ OPQ1 


OOQA 




o^o 


OH7C OCylR 


2<^yo 


o i vjinvj^uuo i oyu 




O.. 1 /uo 


2<iyo 


o i uin ucuuo^you 


0.07 




OOQ7 


OTOKioonno.Qcnn 


Q.OP. 


1 000. 1 0 


OOQQ 


r^~r^iM monnoQ£H n 


oziy 




2299 


O 1 WIMvj^UUoo/ OU 


oou 


Q*^ i QQQ 

yo.. 1 yyy 


<ioUU 




OO 1 


OQ/l QRQ7 


OQQi 
coUl 






/1QO 77n 


oq no 
2oU2 


nxn m r^o n n o. rqq n 




10D0..£04/ 


2oUo 


o i uivu^uuoyo / u 






2JU4 


O 1 WINV^I^UU*+ 1 1 OU 


OOO 




none 




000 


A~7~7 007ft 


none 
2oUb 


OTnivioonnAOfi/m 

O 1 UlNHj^lUU^f^O^tU 




b..^ool 


2oU/ 


O 1 V_JIN\j*LUU*+*f^OU 


000 


1 1 00 Qonc: 


2oUo 






• 1 7 OCC/I 

l / ..*ioo^ 


oono 


o i ursivj^uu^i-oouu 


OA c\ 


00c K ncn 

^b..i yoU 


231 0 


PTAMCOfinyf conn 




oo0..o4U 


OO -4 H 

2311 


O 1 UlMVa^UU'l-y^oU 


o42 


tl b..>J<ibo 


2312 


O 1 vJINLa^UUoU^-yU 


Q/1 Q 

o4o 


4^0. .ooy 


2313 


o i uiNij^uuon uu 


o44 


4B..4 I y 


OO H A 

2314 


rTHMPonnn H /inn 


o4o 


1 239.. 1 823 


2315 


PTOMPonr\co7on 
O 1 UlNta2UUo2/oU 


o4b 


9/6.. 1 OOO 


2316 


o 1 vJNCj^ooooyyo 


347 


0000 ^ once 

2233..>2955 


2317 


CTONG20055670 


348 


2306.. 2620 


2318 


CTONG20055850 


349 


506.. 1246 


2319 


CTONG20056150 


350 


95. .1150 


2320 


CTONG20057750 


351 


2355..2849 


2321 


CTONG20057950 


352 


3061. .3420 


2322 


CTONG20059130 


353 


109.. 2613 


2323 


CTONG20060040 


354 


204.. 2630 


2324 


CTONG20061290 


355 


226. .819 


2325 


CTONG20062730 


356 


307.. 687 


2326 


CTONG20063770 


357 


108.. 3203 


2327 


CTONG20063930 


358 


250..2700 


2328 ; 


CTONG20065240 


359 


1983. .2333 


2329 


CTONG20065680 


360 


2369.. 2797 


2330 
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Table 1 (continued) 



5 



10 



15 



20 



25 



Clone name 

WlU 1 I C 1 lull 1CJ 


SEO ID NO of nucleotide 


Position of CDS 


SEQ ID NO. of amino acid 




sequence 




sequence 


CTONG200661 1 0 


361 


122.. 1972 


2331 


CTONG20068360 


362 


1136. .1876 


2332 


CTONG20069320 


363 


843. 1238 


2333 


CTONG20069420 


364 


130 600 


2334 


CTONG20070090 


365 


561 2960 


2335 


CTONG2007072 


366 


431 ..2677 


2336 


CT0NG20Q70780 


367 


2.. 2896 


2337 


CTONGP0070Q1 0 

\s 1 V— ' 1 N V_3t.UU / UCT 1 w 


368 


87.. 1397 


2338 


CTONG20071 040 


369 


16. .1551 


2339 


CTONGP0071680 


370 


189 >2419 


2340 


pto m PiP n n 7pq ^n 

O 1 UlNdtUU/ ^CTOU 


O/ 1 


iqq PR43 


2341 


PTONP,pnn7'3QQn 


"}7P 


749 2428 


2342 


PTONPiPnn74nnn 


^7^ 
0/0 


R1 ^IRR 

0 1 . .0 1 OO 




PTONIGP00741 70 


374 


153.. 1211 


2344 


PTONGP0074740 

O 1 1 N \*r UU 1 *+ / *+U 


375 


2754 >30RS 


2345 


rjn m p 0 n n 7 r p ^ n 


0/ 0 


piq? prro 


P^4R 


PTONP»Pn07RR1 0 


377 


QOQ P40P 

\7U3,, t t +U^ 


2347 


PTONPP0n777R0 


378 


1R17 P170 


2348 


PTONPPn07R'34n 


379 


mn pppq 


2349 


PTONPPnn7QRQn 


^Rn 


1 4Q 1 Ofifi 

1 *+CT . . 1 UOO 


2350 


PTONPPnnRni4n 


^R1 

OO 1 


1 RQ RRfi 

1 OCT. .OOO 


P^Rl 

tOJ 1 


PTONPPnnRiR4n 




CRfi OQ7 
OOD. .057/ 


P^ C P 


P TOM PPO 0^4.^0 


^Rl 
000 


1 OCT. . 1 OdJ 


2353 




384 


10fi 1440 
1 UD. . t *+*+u 


2354 


PTOMG200R4020 


385 


OU I . .ODD 


2355 


PTONIPPOnR4RRO 


386 


21 7.. 753 


2356 


rTnMn?nnR^9i n 

V>» I UlNO&UUOOd 1 VJ 


387 


7RQ 1 R7R 

/ OCT. .lO/O 


2357 


PTONOP01 T37PO 


^RR 

000 


1 ^n 


P9RR 


PTONPP01 RRRQ0 


^RQ 
ooct 


17P1 P044 




1 UIMU^lU 1 DO /OU 


oyu 


one 1QCC 
c ID,, I CTOO 


cODU 


O 1 \Jv\\Jk£\j 1 oooou 


Oct I 


O^U. . ID 1^ 


^?*^R"1 
tOD 1 


PTONPP01 R77RO 
O 1 kJ\h\J^lU id// ou 


^QP 


■ICQ Afii 

1 Oct. .*+o 1 


P^RP 


PTOM^ni RRP40 


090 


4Q1 ft*>fi 

*tCT I . .OOD 


P^R^ 


O 1 VJINVJ^U I DO^fOU 


OCT*+ 


ODct. .0 / I 


O^RZL 


PTO N PO0 1 £Qfi/lfi 


OctD 


I OCT. .OCTH- 




O 1 vjin^hiu 1 dctdou 


octo 


1 U/D.. 1 OCTCT 


£ODD 


CTONG201 70940 


397 


184.. 1569 


2367 


CTONG201 74290 


398 


96.. 1826 


2368 


CTONG201 74440 


399 


1246.. 1623 


2369 


CTONG201 74580 


400 


1..1023 


2370 


CTONG201 76040 


401 


147. 737 


2371 


CTONG201 79390 


402 


1423.. 1881 


2372 


CTONG20 179890 


403 


335.. 2344 


2373 


CTONG201 79980 


404 


1821. .2210 


2374 


CTONG201 80620 


405 


1556.. 1915 


2375 


CTONG201 80690 


406 


366.. 1442 


2376 


CTONG20181350 


407 


1167.. 1607 


2377 


CTONG201 83430 


408 


31 9.. 2706 


2378 


CTONG201 83830 


409 


393.. 2687 


2379 
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Table 1 (continued) 



5 



10 



15 



20 



25 



Clone name 


SEQ ID NO. of nucleotide 


Position of CDS 


SEO ID NO of amino acid 

^ W 1 1—* l^^^« \Jt ul l Ml lw uwlu 




sequence 




sequence 


CTONG20184130 


410 


1970.. 2536 


2380 


CTONG201 84830 


411 


323. 1204 


2381 


CTONG20186140 


412 


21 ..443 


2382 


CTONG201 86290 


413 


21 70. .2811 


2383 


CTONG201 86370 


414 


156,. 1244 


2384 


CTONG201 86520 


415 


211 ..2136 


2385 


CTONG201 86550 


416 


260. .811 


2386 


CTONG201 88080 


417 


207. .2774 


2387 


CTONG201 89000 


418 


1 74. 1 682 


2388 


CTONG201 90290 


419 


220 2205 


2389 


CTONG201 90630 


420 


34 2082 


2390 


DFNFS2001 6470 


421 


288 851 


2391 


DFNF552001 8000 


422 


40 15QR 


2392 




423 


7RR 1 1 34 

/ Ou. . I 1 OH 


2393 


DFIMFS200281 70 


424 


pnq i 734 


2394 


r> piu p Q?00?QfifiO 


425 


356 9443 

03D. . iCH-H-O 


93Q5 






35.. 931 


93QR 


DFIMF^PO 04371 0 


427 




93Q7 


n f w f ^ ?n n4 r R4o 


428 


7RR 1 31 0 
/ oo. . IO IU 


939R 


D rlMCOt.UU J JHUU 


429 


343 1 5R4 


93QQ 


nFNFS200S7660 

L/l I'fLUL.UUJ / UUU 


430 


25.. 795 


2400 


DFNFS200R3460 


431 


26. .406 


2401 


DFNFS?007?990 


432 


62.. 11 56 


2402 


DFNFS200733PO 


433 


377.. 1 735 


2403 


nFNF c 5?007fi340 


434 


1R7 RQR 


94H4 


DFNES20080880 


435 


11 ..1669 


2405 


DFNFS2008881 0 

L/l IMLO^UUDUO 1 U 


436 


1 71 ..548 


2406 


nFNF c i?nnQ4fl9n 


437 


in7 1 Rn7 


94H7 


FHRRF1 nnnn93n 

I ODDl 1 UUUUcOU 


438 


1 H4 3947 


94nR 


1 w DDI 1 Uvv^l^UU 


439 


4ftn 7R9 


940Q 


FPRRF1 nnn47RO 


440 


C7ft 1 Q7Q 

o# o. . iy/o 


941 n 


FHRRFPnm flRRn 


441 


1 77 1 794 


9411 


pf r d pp n non 44n 


449 




941 9 


FPRRFonn? 1 1 1 n 


443 


I OD..O I / 


941 *5 




444 


FiA 1 77Q 

o*+.. i / /y 




Fr^RRFPnnORQRO 


445 


RQ4 mi4 




FCBBF20029280 


446 


1680..2021 


2416 


FCBBF20032930 


447 


90. .452 


2417 


FCBBF20033360 


448 


31 5.. 2369 


2418 


FCBBF20035430 


449 


196.. 726 


2419 


FCBBF20035490 


450 


131. .1387 


2420 


FCBBF20036360 


451 


16.. 366 


2421 


FCBBF20038230 


452 


1468. .1908 


2422 


FCBBF20038950 


453 


648.. 992 


2423 


FCBBF20041380 


454 


612. .2174 


2424 


FCBBF20043730 


455 


45..>2063 


2425 


FCBBF20054390 


456 


1367.. 1756 


2426 


FCBBF20056580 


457 


82..>2394 


2427 


FCBBF20059660 


458 


672.. 1226 


2428 
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Table 1 (continued) 



5 



10 



15 



20 



25 



f^lnnf* namp 

wlUI IC 1 ICXI 1 IC 


SFQ ID NO of nurlpotlrip 


Position of CDS 


SFO ID NO of amino arid 




sequence 




sequence 


FCBBF20061310 


459 


1582 1980 


2429 


FC BBF20066340 


460 


11 6.. 1024 


2430 


FCBBF20070800 


461 


968 1 447 


2431 


FCBBF20070950 


462 


50 >2299 


2432 


FCBBF3000001 0 


463 


74.. 643 


2433 


FORRF^OOOI 020 


464 


1 6.. 348 


2434 


FORRF^OOOI 1 00 


465 


ft41 ^IPR 

Ot 1 . . tf^O 1 C—^J 


2435 


FORRF^OOOI 1 R0 

ivODrOUVU I I JU 


466 


190Q 16^1 

1 tUc< > . 1 DO 1 


2436 


FCRRF30002270 


467 


31 9. .927 


2437 


FCRRF300029R0 


468 


113 4036 


2438 


FPRRFftono9ftfto 


469 


fi R11 


2439 


roDDrouuuoD i u 


470 
*+/ u 


101R 901Q 


2440 


no DDrouuuHOHU 


471 
t / i 


190 101** 


9441 
ttt i 


fprrf^ooo47'30 


472 


^Rft 1 097 

OUO . . 1 ut / 


2442 


Ff^RRFinnoRi no 


473 


1 RR 97^4 

i JU. . t 1 Ot 


2443 


FPRRF^OOOR^RO 


474 


29 271 0 


2444 


FPRRF^fiOORRnn 


47R 

t / o 


to i . .tfyf 


944R 

ttto 


roDDrouu is i *t u 


47R 
t / O 


40 9Q7Q 


944R 
ttto 


FfRRF^nm Q1 no 


477 


140 1 RRQ 

I tU . . I DDcJ 


2447 


FPRRF^OOI 0.9 


47ft 
t / o 


1914 99RQ 


944ft 
ttto 


FPRRF^OOOI QOO 


47Q 


R^ 1Q7^ 
do., i y / o 


944Q 
ttty 


FPRRF^0099RftO 


4ftO 


1 ^Ofl 94ftO 
I OUO . . ttou 


94RO 
ttou 


FPRRF^OOPRRflO 


4ft 1 

HO I 


170 979R 
I / u. .t / 


94^1 

ttO 1 


Fr*RRF < 3009Q9R0 


4R9 


R9 40ftR 

3t . .tUOO 


94^9 
ttot 


FPRRP^nn^^^7n 


4ft^ 


91 7 >^C>4Rft 
tu, .>ttOO 


94*^ 
ttOO 


Fr > RRF'30049R1 0 


4ft4 
tot 


1 944 

OO. . I ttt 


94P»4 
ttot 


FORRF'3nn4ft49n 


4ft f> 
too 


1 44 1 DQ4 
i tt . . i uy t 


ttoo 


FPRRF^nncmnn 


4RR 
too 




94RR 
ttoo 


FrRRF^nncifiQfln 


4ft7 
to / 


1 non iair 
i uyo. . j 1 1 o 


94R7 
tto/ 


Fr k RRFftnOR94QO 


4ftft 
too 


1 Oft 1 OR^ 
1 £0. . 1 UDO 


t*fOo 




4RQ 


pen ftPft 

tOU. .OtO 


ttoy 


FPRRF^nflRftOI n 


AQCi 

tyu 


Ci P7RP 
O t . .t / Dt 


OA RO 
tt DU 


FORRF^fin71 Ron 
rODDroUU/ IOUU 


ty i 


PP7 ^PftQft 

tt / . .>toyo 


OA C51 


FrRRFQfin79^4ft 
rUDDrouu/ tt*+u 


AQO 

*+yt 


t*4-0O. .tOOo 


OAG.O 

ttot 


rvDDrOUu/iHOl) 




tOUt..>oUo/ 


oa ao 
ttbo 


fo RRF^nn7zi^n 


/4Q4 

tyt 


■f n*^i I/no 

1 UO 1 . . I tot 


OA d.A 

t^ot 


FCBBF30074620 


495 


840..1316 


2465 


FCBBF30075970 


496 


146.. 460 


2466 


FCBBF30076310 


497 


42. .1007 


2467 


FCBBF30078600 


498 


113.. 1399 


2468 


FCBBF30079770 


499 


693.. 2777 


2469 


FCBBF30080730 


500 


54.. 467 


2470 


FCBBF30081000 


501 


526.. 924 


2471 


FCBBF30085560 


502 


60.. 1919 


2472 


FCBBF30088700 


503 


39..>3015 


2473 


FCBBF30089380 


504 


8 .2701 


2474 


FCBBF30091010 


505 


172..>3465 


2475 


FCBBF30091520 


506 


56.. 2284 


2476 


FCBBF30093170 


507 


974.. 1528 


2477 
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Table 1 (continued) 



5 



10 



15 



20 



25 



Clone name 


SEQ ID NO. of nucleotide 


Position of CDS 


SEQ ID NO. of amino acid 




sequence 




sequence 


FCBBF30095410 


508 


64.. 1002 


2478 


FCBBF30099490 


509 


2939..3256 


2479 


FCBBF301 00080 


510 


1237.. 1656 


2480 


FCBBF301 00120 


511 


8. .2305 


2481 


FCBBF301 00410 


512 


121 ..1374 


2482 


FCBBF30101240 


513 


1 696..2382 


2483 


FCBBF301 01 300 


514 


3802..>4413 


2484 


FCBBF301 05080 


515 


973 1 836 


2485 


FCBBF301 05440 


516 


1354.. 2040 


2486 


FCBBF301 05860 


517 


355 >2524 


2487 


FCBBF301 06Q50 


518 


1 92..944 


2488 


FCBBF301 07290 


519 


1002 1610 


2489 


FCRBF301 07330 


520 


703 1071 


2490 


FCBBF301 1 41 80 


521 


M77 2289 

lift ■ . L—t—\S> C7 


2491 


FCBBF301 1 4850 


522 


76Q 1 503 


2492 


FCBBF301 1 5230 


523 


41 7..7S5 


2493 


FCBBF3011 5920 


524 


543 1 802 


2494 


FCBBF301 1 8670 


525 


8Q1 2807 


2495 


FCBBF301 1 8890 


526 


1R4 ->2610 


2496 


FCBBF301 25460 


527 


67 -| 926 


2497 


FCRRF301 25880 


528 


185 670 


2498 


FORRF301 28420 


529 


178Q 2130 


2499 


FORRF3012Q01 0 

1 wuur JU 1 C7 U I KJ 


530 


1 84 1 236 


2500 


FCRRF301 3041 0 


531 


114Q 1874 


2501 


FCRRF301 30580 


532 


156 2123 


2502 


FPRRF^OI ^2050 


533 


71ft 1RS4 


2503 


FCBRF301 32660 


534 


86 1051 


2504 




535 


214 *>24ft^ 


2505 




536 


24 'V^R 


2506 


FCRRF301 38000 
i uurou i oouuu 


537 


646 2Q01 


2507 


FPRRF301 422Q0 


538 


67Q 1 662 


2508 


FPRRF^OI 43*^0 


539 


111 ^1Q1 




FPRRF^m 4R670 


540 


1 '^'V^ 1 RftO 
I OOO. . I oou 


2^1 fl 


FPRRF^ni 1 on 


^41 


Q74 n 1 2 


2^11 


FPRRF^fll 1 70 


*^42 


1 R 2*^07 


2*^1 O 


rODDrOu IO / c.Ikj 


R4^ 


OH, ,>gOUO 


2*>1 L 


FCBBF30161780 


544 


21. 659 


2514 


FCBBF301 64510 


545 


561.. 3035 


2515 


FCBBF301 66220 


546 


178.. 483 


2516 


FCBBF301 69280 


547 


116..901 


2517 


FCBBF301 69870 


548 


102.. 407 


2518 


FCBBF301 70710 


549 


8.. 382 


2519 


FCBBF30171230 


550 


1735..2361 


2520 


FCBBF301 72330 


551 


2497.. 2952 


2521 


FCBBF301 73960 


552 


106..>3530 


2522 


FCBBF301 75350 


553 


2. .721 


2523 


FCBBF301 77290 


554 


378.. 923 


2524 


FCBBF30179180 


555 


2382..>3452 


2525 


FCBBF301 79740 


556 


299. .721 


2526 
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Table 1 (continued) 



5 



10 



15 



20 



25 



Clone name 


SEQ ID NO. of nucleotide 


Position of CDS 


SEQ ID NO of Rmino arid 




sequence 




sequence 


FCBBF30181730 


557 


43. .351 


2527 


FCBBF30 194370 


558 


698.. 1057 


2528 


FCBBF301 94550 


559 


9. .1982 


2529 


FCBBF301 95690 


560 


15. .1682 


2530 


FCBBF301 95700 


561 


14. .376 


2531 


FCBBF301 97840 


562 


406. .3015 


2532 


FCBBF301 98670 


563 


421 ..2733 


2533 


FCBBF30201 630 

1 W*l— >L_/i ^Utv ■ www 


564 


1407 2561 

1 ^^w f • • www I 


2534 


FCBBF3021 221 0 


565 


1669. .2502 


2535 


FCBBF3021 5240 


566 


774 1 586 

/ 9 "*T, i 1 w ww 


2536 


FC BBF30220050 

1 ^/l_JL~H wvJwwwwww 


567 


1431 2006 


2537 


FCBBF3022291 0 


568 


11 7. .614 


2538 


FCBBF30223 110 


569 


647.. 1129 


2539 


FCBBF3022321 0 


570 


141 ..680 


2540 


FCBBF30225930 


571 


1 66 1 956 


PR41 


FCRRF30228940 


572 


79 453 


PR4P 


FHRRF^OP^ORI 0 


R7*3 

O/ o 


qq 440 


PR4^ 


FCBBF30236670 


574 


188Q 2719 


PR44 


FPRRF^OPROQflO 


575 


214 2514 


PR4R 


FCRRF^OPRRfiflO 


576 


P7 -*P4Rn 


PR4R 


FCBBF3025737 


577 


2..1 873 


PR47 


FCRRF302RQ050 


578 


1 04 1 Ron 


PR4ft 


FHRRFRnPRDPI O 


579 




PR4Q 


FCBBF302R0480 


580 


28. .51 9 


2550 


FCRRF302fi^0fi0 

■ u u r o vj £_ uo uuu 


581 


R^R qnn 


PRR1 


FCRRF^OPfifiRI 0 


582 


//ic Q<l7ft 
't'lj. ,oo / o 


PRRP 


FCBBF30271 990 


583 


1 87.. 1 704 


2553 


FORRF^OS^RRQO 


584 


R1 -^0*\7A 
O 1 . .«>^o/*t 


PRR4 


FHRRF^nPftPOPn 


RftR 


1 P^ 1 7P1 
I ^O. . I / tL \ 


occc 


FHRRF^nPflRQ^n 
rVb/DDroutoaaou 


RftR 


ocn fiQ7 
cDU,.Dy/ 




FnRRF^nPR7Q4n 
1 w DDiOwlO / c?*+w 


RR7 

JO / 


1 R*3R 9H7Q 


PRR7 


FPRRFzinnnnRi n 

rvDDrtUUUUD 1 U 


coo 
ooo 


RRR 

ODD. . I I O 1 


OCCQ 

<cooo 


FHRRFdnom QPO 




7CO H DRP 

/ OO. . 1 UO£ 


OCCQ 


rL/DDrfUUUOUUU 


con 


R7 R 




FPRRF^nnnnzii n 

l DDIiJUUUU't i W 


RQ1 


QOA 1 oca 

you. . i ^oo 




rODDrOUUUUD 1 U 


oy*i 


ooo. .by / 




FCBBF50001650 


593 


562.. 181 5 


2563 


FCBBF50003530 


594 


127.. 921 


2564 


FCBBF50004950 


595 


21 56.. 2545 


2565 


FEBRA20005040 


596 


295.. 21 00 


2566 


FEBRA20007820 


597 


160.. 690 


2567 


FEBRA20018670 


598 


184.. 1077 


2568 


FEBRA20026820 


599 


103.. 1836 


2569 


FEBRA20027070 


600 


663.. 1736 


2570 


FEBRA20029620 


601 


565.. 1206 


2571 


FEBRA20031000 


602 


380..2551 


2572 


FEBRA20031150 


603 


2925.. 3293 


2573 


FEBRA20031280 


604 


362.. 31 24 


2574 


FEBRA20031810 


605 


1093.. 1455 


2575 
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Table 1 (continued) 



5 



10 



15 



20 



25 



Ctone name 


SEQ ID NO. of nucleotide 


Position of CDS 


SEQ ID NO. of amino acid 




sequence 




sequence 


FEBRA20035200 


606 


2607.. 321 5 


2576 


FEBRA20035240 


607 


74.. 826 


2577 


FEBRA20038220 


608 


157.. 1305 


2578 


FEBRA20038330 


609 


161 2.. 2253 


2579 


FEBRA20038970 


610 


1279.. 2811 


2580 


FEBRA20039070 


611 


967.. 1575 


2581 


FEBRA20039260 


612 


23. .685 


2582 


FEBRA20040230 


613 


1663 2076 


2583 


FEBRA20040260 


614 


244. .561 


2584 


FEBRA20040290 


615 


1 488 2330 


2585 


FEBRA20040560 


616 


711 ..1496 


2586 


FEBRA20045380 


617 


81 ..398 


2587 


FEBRA20046200 


618 


125 2062 


2588 


FERRA20046280 


619 


6Q4 1017 


2589 


FEBRA2004651 0 


620 


R5Q 22S6 


2590 


FEBRA2005701 0 


621 


757 1107 


2591 


FFRRA20063720 


622 


11 R 1 R7R 
I I o . . I o / o 


9^Q9 


FEBRA20076200 


623 


303 680 


2593 


FEBRA200781 80 


624 


1S17 1888 


2594 


FFRRA20078800 


625 


24.. 644 




FEBRA20080860 


626 


584 241 Q 


9RQ6 


FEBRA20082660 


627 


11 4..1 793 


2597 


FEBRA2008341 0 


628 


11RQ 1^61 


2598 


FEBRA20084750 


629 


888 1 202 


2599 


FEBRA20086600 


630 


670 1 407 

O / \J . . 1 t -r\J 1 




FEBRA20087550 


631 


"1140 1814 


9601 


FEBRA2008861 0 


632 


315 818 


2602 


FEBRA2008881 0 


633 


1302 2021 




FFRRA200Q01 60 


634 




9RH4 


FERRA200Q0220 


635 


106 9971 


9Rrm 


FFRRA200Q1 620 


636 


1 U/IO. . 1 300 


9Rn« 

cOUO 


FFRRA200Q2760 


637 


6Q4 1*317 

057*+ . . I O I / 


9RH7 


FERRA200Q3270 


638 


990^ 9R07 




rCDnntUUaOcOU 












9^7 fi/17 


*LO 1 U 




R41 






FEBRA20099860 


642 


55.. 573 


2612 


FEBRA20101410 


643 


396.. 740 


2613 


FEBRA201 08020 


644 


181. .492 


2614 


FEBRA201 08580 


645 


542.. 901 


2615 


FEBRA201 15930 


646 


58. .1494 


2616 


FEBRA201 16650 


647 


178.. 573 


2617 


FEBRA20121200 


648 


972.. 1490 


2618 


FEBRA20121950 


649 


1217.. 1723 


2619 


FEBRA20141980 


650 


644.. 955 


2620 


FEBRA201 50420 


651 


314..3124 


2621 


FEBRA20151750 


652 


52..>2299 


2622 


FEBRA201 63980 


653 


93.. 1223 


2623 


FEBRA201 70240 


654 


385.. 1632 


2624 



18 
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Table 1 (continued) 



Clone name 


SEQ ID NO. of nucleotide 


Position of CDS 


SEQ ID NO. of amino acid 




sequence 




sequence 


FEBRA201 72230 


655 


377.. 1363 


2625 


FEBRA201 73330 


656 


462.-2378 


2626 


FEBRA201 75020 


657 


2040.. 2438 


2627 


F E BR A20 175330 


658 


62.. 520 


2628 


FEBRA20 177800 


659 


1918. .2289 


2629 


FEBRA201 80510 


660 


350. 889 


2630 


FEBRA201 82030 


661 


341.. 745 


2631 


FEBRA201 87460 


662 


395.. 736 


2632 


FEBRA20191720 


663 


41 7. .836 


2633 


HCHON1 0000150 


664 


197.. 670 


2634 


HCHON1 0001 660 


665 


913. .1338 


2635 


HCHON20000870 


666 


442.. 1857 


2636 


HCHON20002650 


667 


4.. 792 


2637 


HCHON2000271 0 


668 


169. .1437 


2638 


HCHON2001 5050 


669 


529.. 1 995 


2639 


H EART1 0001 420 


670 


4.. 1476 


2640 


HEART1 0001 490 

1 1 1 — 1\\ 111 \J\J\J l *Tv/V 


671 


159. .1121 


2641 


HEART20009590 


672 


1 369 1 938 


2642 


HEART2001 931 0 


673 


47.. 1693 


2643 


H EART20022200 

1 « 1 — #\ 1 1 1 C— v vL^L^L. VV 


674 


11 ..1378 


2644 


HEART20031 680 

1 1 1 1 1 L. WU 1 W-W V 


675 


1033. .301 8 


2645 


H EART20047640 


676 


595 2904 


2646 


H E ART20063 1 00 


677 


131 ..826 


2647 


H FART20082570 


678 


18.. 1022 


2648 


HHDPC1 0001 140 

III IL/ I v \ UuU 1 1 *-rv 


679 


506 1102 


2649 


HHDPC20051 850 


680 


21 ..422 


2650 


HHDPC20081230 


681 


132.. 21 95 


2651 


HHDPC20082790 


682 


150 608 


2652 


HHDPC20082970 

i ii iL/rut.uuuL.a/ u 


683 


1445.. 1771 


2653 


HHDPC200881 60 


684 


214 >2639 


2654 


HLUNG20008460 


685 


157 1818 


2655 




686 


1 0^*5 1 RS6 


2656 


hi i !MniPnnnQR5n 


687 


1 6. .657 


2657 


Ml I IMf;9finifl1^n 
n LUIMUcUU I u tou 


688 


1 UOd . . 1 HUU 


2658 


HI UNG?00112fiO 

(1 LUI N \J £.\J\J 1 l£OU 


689 


135.. 479 


2659 


hi i iNfipnnnzkin 




ovjo . . i cJ\j / 


2660 


HLUNG20011460 


691 


43. .1587 


2661 


HLUNG20012140 


692 


188. .508 


2662 


HLUNG20014590 


693 


1015..>2241 


2663 


HLUNG20015070 


694 


377.. 1945 


2664 


HLUNG20015180 


695 


468.. 1808 


2665 


HLUNG20020500 


696 


1232. .1687 


2666 


HLUNG20020850 


697 


802.. 11 79 


2667 


HLUNG20021450 


698 


753.. 1109 


2668 


HLUNG20023030 


699 


2466.. 2855 


2669 


HLUNG20024050 


700 


651.. 1568 


2670 


HLUNG20025620 


701 


1424.. 1765 


2671 


HLUNG20028110 


702 


164.. 1996 


2672 


HLUNG20029420 


703 


160.. 774 


2673 



19 



EP 1 308 459 A2 



Table 1 (continued) 



5 



10 



15 



20 



25 



Clone name 


SEQ ID NO. of nucleotide 


Position of CDS 


SEQ ID NO. of amino acid 




sequence 




sequence 


HLUNG20029490 


704 H 


135.. 563 


2674 


H LU NG20030420 


705 


116. .1906 


2675 


HLUNG20030490 


706 


553.. 1 608 


2676 


HLUNG2003061 0 


707 


920 1 465 


2677 


HLUNG20031 620 


708 


604. .1038 


2678 


HLUNG20032460 


709 


29.. 1582 


2679 


HLUNG20033060 


710 


1085..>2287 


2680 


HLUNG2003331 0 


711 


1319 1654 

1 O I C7 . . 1 UOT 


2681 


HLUNG20033350 


712 


1035..>2205 


2682 


H LUNG20034970 


713 


1 046 1 747 


2683 


HLUNG20037140 


714 


1 342 1 698 

1 0*Tt. ■ . 1 UC7U 


2684 


HLUNG200371 60 


715 


1704 3152 


2685 


H LUNG20037780 


716 


1190 1855 


2686 


HLUNG20038330 


717 


433 1 284 

"TOO . . 1 CO*T 


2687 


HI UNG20041 540 


71 8 


1 99 1 542 


2688 


HL UNIG20041 S90 


719 


1 30 =>?341 


2689 


HI LJNIG20042730 


720 


icn -i 404 

1 •sJKJ . . 1 HOH 


2690 


HI I JNG?004 c i340 


721 


1 RQ RfiR 


2691 


HI UNG 200470 70 


722 


1*310 1 R7? 

IO I U.. I U / c 


2692 


HI UMG200S0760 


723 


1811 9118 

IO IO. 1 1 o 


2693 


HI LJNGPOO^I 330 


724 


55 > 2821 


9RQ4 


HI UNIG200^2300 


725 


88 1008 


2695 


HI LJNG200S47Q0 


726 


1418 ?*vlfl 


2696 


HI UIMG200SS240 


727 


204.. 554 


PRQ7 


HI UNG200S6560 


728 


6. .51 2 


2698 


HL UNG20057380 


729 


303 788 

OUO . . / OO 




HL LJNG?00SQ?40 


730 


1 *^PQ 1 870 




HLUNG2006Q670 


731 


7cq 1S17 

/ OO . . 1 J 1 / 


2701 


HI LJNIGP00R3700 


732 


1^RR 1RR*^ 

1 OOO. . 1 DOO 




HI MMGP006R700 


733 


QO Q11 


C- i uo 


HI UMGPOOfiRQQO 


734 


OH-H-. . 1 1 


0704 

£. 1 U*t 


hi UM(^?nnfi7fti n 


735 


17R R9Q 


f Uo 


hi i JW<^?nnfi8i on 


736 


R*^ 1PR1 

OoO. . i *no i 


27fiR 
d. 1 UO 


hi I iNf^onnfiQ'^n 


/Of 


1 QR 1R71 

i C70. . IO/ 1 


c. i U / 


hi i iNf^?nn7n4i n 

nLU l\OcUU / UH- I U 


7^R 


474 Q9Q 


d. I UO 


hi I iMfi9nn79inf4 


/ OS? 


1 Rid 1 R7Q 

i oh-. . i o/y 




HLUNG20072190 


740 


1394.. 1723 


2710 


HLUNG20072450 


741 


127.. 468 


2711 


HLUNG20074330 


742 


1621..>1976 


2712 


HLUNG20079310 


743 


1484.. 2050 


2713 


HLUNG200813 


744 


123. .1910 


2714 


HLUNG20081530 


745 


1218.. 1889 


2715 


HLUNG20082350 


746 


313..2109 


2716 


HLUNG20083330 


747 


770.. 11 38 


2717 


HLUNG20083480 


748 


183. .1895 


2718 


HLUNG20083840 


749 


1389.. 1811 


2719 


HLUNG20083960 


750 


1214.. 1630 


2720 


HLUNG20084790 


751 


1185..1745 


2721 


HLUNG20085210 


752 


138. 779 


2722 
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Table 1 (continued) 



5 



10 



15 



20 



25 



Clone name 


SEQ ID NO. of nucleotide 


Position of CDS 


SEQ ID NO of amino acid 




sequence 




sequence 


HLUNG20088750 


753 


1321. .1635 


2723 


HLUNG20092530 


754 


224.. 577 


2724 


HLUNG20093030 


755 


1836.. 2246 


2725 


HLUNG20094130 


756 


21 29.. 2554 


2726 


KIDNE20011600 


757 


197. .601 


2727 


KIDNE20016360 


758 


81 .2798 


2728 


KIDNE20024380 


759 


1017..>1994 


2729 


KIDN E20027980 


760 


329 1891 


2730 


KIDNE20080690 


761 


1 ..1500 


2731 


KIDNE200811 70 


762 


1356 2444 


2732 


KIDNE20083150 


763 


863 1 342 


2733 


KIDNE20083620 


764 


21 6.. 11 42 


2734 


KIDNE20084030 


765 


28.. 1572 


2735 


K 1 DN E20084040 


766 


318 926 


2736 


KIDNE20084730 


767 


580 2511 


2737 


KIDNE20084800 


768 


9 332 


2738 


KIDNE20086490 


769 


162 1919 


2739 


KIDNE20086660 

IX 1 1— / 1 N 1 f iVvUwUUv 


770 


341 ..700 


2740 


KIDN E20086970 


771 


202. .846 


2741 


KIDNF20087880 


772 


rrr 1 nn3 


2742 


K IDN E20088240 


773 


101 1135 


2743 


KIDNE20089870 

l\IL/l x L-£.vUUC7U/ V 


774 


212. .1621 


2744 


KIDNF200Q1 090 


775 


272.. 670 


2745 


KIDNF200942R0 


776 


125.. 442 


2746 


K IDN F20094R70 


777 


QQ7 


2747 


K IDN E20095530 


778 


1R73 >1878 


2748 


KIDNE201 33460 


779 


210 >1556 


2749 


KIDNF201 33880 


780 


743.. 1 057 


2750 


KIDNF20134130 


781 


1 74.. 797 


2751 


KIDNF201 34890 


782 


384.. 923 


97RP 
C- I Od- 


K IDN F201 3731 0 


783 


*3n4 ft7R 
out, .Of o 


2753 


KIDNF201 384S0 


784 


4QQ fiPft 

*+5757. .0*1.0 


2754 


K IDN F201 40870 

rv list \ \—c-\j i tuu/ \j 


785 


7RD V^POR 


2755 


KinNFom 411 5>n 


7RR 




97RR 
d. 1 JO 


rx I IN C^*J I *+ I / \J\J 


7ft7 

/Of 




07C7 


KinMFom 4pfiRn 

rviuiM c^ivj i tioou 


7RR 


! 7QR lOQQ 
j / t?0. . I COS 


07CO 

c. i OO 


KIDNE20142900 


789 


45. 764 


2759 


KIDNE20 143200 


790 


982.. 1452 


2760 


KIDNE20147170 


791 


1017.. 1493 


2761 


KIDNE20148080 


792 


1097.. 1675 


2762 


KIDNE201 49780 


793 


267.. 1370 


2763 


KIDNE201 50730 


794 


1671. .1991 


2764 


KIDNE20 152440 


795 


3. .1346 


2765 


KIDNE201 54330 


796 


422.. 271 3 


2766 


KIDNE201 54830 


797 


1588.. 1923 


2767 


KIDNE201 55980 


798 


11 64.. 1595 


2768 


KIDNE20157100 


799 


90.. 1286 


2769 . 


KIDNE201 60360 


800 


41 3.. 2692 


2770 


KIDNE201 60960 


801 


10..534 


2771 
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Table 1 (continued) 



5 



w 



15 



20 



25 



Clone name 


SEQ ID NO of nucleotide 


Position of CDS 


SEQ ID NO of amino arid 




sequence 




sequence 


KIDNE201 63710 


802 


1605.. 1973 


2772 


KIDNE201 65390 


803 


68.. 2341 


2773 


KIDNE20169180 


804 


125. .2146 


2774 


KIDNE201 70400 


805 


597 2192 


2775 


KIDNE201 73150 


806 


51 ..1052 


2776 


KIDNE201 73430 


807 


136 1296 


2777 


KIDNE201 76030 


808 


2418 2879 


2778 


KIDNE20181670 


809 


180 659 


2779 


KIDNE201 82540 


810 


135 1745 


2780 


KIDNE20186170 


811 


83.. 748 


2781 


KIDNF201 88630 

(ML/I v 1 t-J U kJkJ \J 


812 


519 9P6 


2782 


KIDNF201 89890 


813 


1 834 2445 

1 00"+. .ttto 


2783 


KIDNE201 89960 


814 


284 1 666 


2784 


KIDNF20191870 


815 


RRO 1 372 
oou. ■ 10 / c 


2785 


I IVFR20006260 


81 6 


379 1668 


2786 


LIVER20007690 


81 7 


3P9 760 


2787 


L IVER20007750 


818 


1 76 1 549 


2788 


LIVFR2001 051 0 

l_ 1 V LI ltvv 1 \J O I VS 


81 9 


757. 1 1 43 


2789 


L IVFR2001 0760 


820 


30. -OOO 


2790 


L IVFR2001 0990 


821 


30*5 1009 


2791 


LIVER20011640 


822 


1187 P086 


2792 


L IVER20013890 


823 


1^28 P199 


2793 


L IVER20026440 


824 


1019 1963 


2794 


LIVER20030650 


825 


1469 PP39 


2795 


I IVFR20032340 


826 


c IOI . .^OVJt 


^7QR 
£. / J7O 


I IVFR20038000 


8P7 


78 1 004 


57Q7 


L1VFR20040740 


828 


1 C70 . . lOf U 


2798 


L IVFR2005S270 


829 


1 48 1 347 

1 *+0. . 1 OH/ 


?7QQ 


M F^ANPOOORPOO 


R3n 


9RQO 


cDUU 


M F^ANP00071 1 0 


R31 

OO 1 


I *f U I . . I 0<lO 


£OUl 


MF^ANPOnORI RO 




1 QRR7 

I 00. .000/ 




IVI tZOMlN^lUUVJOcJHL/ 


ooo 


100 mA 

I .0 I *f 


£OUJ 


m F < ^ANPnnoQn90 


R^4 






IVICOMINtUU IO^/U 


R^^ 




none 


IVI COnIM ilUU^l I IOC 


OOO 


1 ^40 9R7R 


none 


MPQAM9nn9199H 
IVI COnlMdUUii I 


OO/ 


O/l OQOO 

y^-..^o^^i 


2807 


MESAN20021470 


838 


658. 1446 


2808 


MESAN20021860 


839 


217.. 1113 


2809 


MESAN20026870 


840 


63.. 2450 


2810 


MES AN 20027240 


841 


39.. 1940 


2811 


MESAN20027900 


842 


21 2. 3322 


2812 


MES AN 20029780 


843 


1900..2331 


2813 


MESAN20030350 


844 


142..>2239 


2814 


MESAN20030370 


845 


735.. 2462 


2815 


MESAN20030390 


846 


3. 389 


2816 


MESAN20033220 


847 


68.478 


2817 


MESAN20034440 


848 


42.. 21 83 


2818 


MESAN20038520 


849 


31. .2547 


2819 


MESAN20041380 


850 


7.. 342 


2820 
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Table 1 (continued) 



5 



10 



15 



20 



25 



Clone name 


SEQ ID NO. of nucleotide 


Position of CDS 


SEQ ID NO. of amino acid 




sequence 




sequence 


MESAN20045750 


851 


145.. 1002 


2821 


MESAN20056890 


852 


187.. 1125 


2822 


MESAN20057240 


853 


101.. 778 


2823 


MESAN20058110 


854 


174.. 1022 


2824 


MESAN20059570 


855 


280.. 1782 


2825 


MESAN20060220 


856 


464.. 775 


2826 


MESAN20060430 


857 


1589 .3469 


2827 


MESAN20065990 


858 


21 3.. 533 


2828 


MESAN20067430 


859 


1287.. 1973 


2829 


MESAN20069530 


860 


336..>3472 


2830 


MESAN20084150 


861 


78.. 665 


2831 


MESAN20085360 


862 


11 68. .1656 


2832 


MESAN20089260 


863 


169.. 1254 


2833 


MESAN20090190 


864 


256. 2898 


2834 


MESAN20094180 


865 


1696. .2139 


2835 


MESAN20095220 


866 


1118. .1972 


2836 


MESAN20095800 


867 


31 ..1137 


2837 


NESOP20004520 


868 


109. .1512 


2838 


NESOP20005040 


869 


316. .1308 


2839 


NT2NE200 18740 


870 


1413. .1772 


2840 


NT2NE20018890 


871 


39.. 1061 


2841 


NT2NE20021860 


872 


268.. 1653 


2842 


NT2NE20026200 


873 


1888.. 2907 


2843 


NT2NE20026510 


874 


338. .1381 


2844 


NT2NE20028700 


875 


166.. 1377 


2845 


NT2NE20033150 


876 


496. .912 


2846 


NT2NE20037050 


877 


19. .411 


2847 


NT2NE20038870 


878 


190.. 1548 


2848 


NT2NE20039210 


879 


226.. 639 


2849 


NT2NE20042550 


880 


708.. 1568 


2850 


NT2NE2 0045 190 


881 


33.. 599 


2851 


NT2NE2 0047870 


882 


314. .1207 


2852 


NT2N E20053230 


883 


622 1 068 


2853 


NT2N E20053950 


884 


133 993 

1 w w . , C/ w 


2854 


NT2NE20059210 


885 


35. .391 


2855 


NT2NE20059680 


886 


120.. 443 


2856 


NT2NE20060750 


887 


239.. 928 


2857 


NT2NE20061030 


888 


160.. 600 


2858 


NT2NE20062880 


889 


201. .539 


2859 


NT2NE20064780 


890 


284.. 1771 


2860 


NT2NE20066590 


891 


821. .1204 


2861 


NT2NE20069580 


892 


1404.. 2282 


2862 


NT2NE20070520 


893 


347.. 661 


2863 


NT2NE20073650 


894 


756.. 1088 


2864 


NT2NE20077250 


895 


1815. .3185 


2865 


NT2NE20077270 


896 


250..>3642 


2866 


NT2NE20077860 


897 


589.. 1053 


2867 


NT2NE20079670 


898 


404.. 1867 


2868 


NT2NE20080770 


899 


1181 .1483 


2869 
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Table 1 (continued) 



Clone name 


SEQ ID NO. of nucleotide 
sequence 


Position of CDS 


SEQ ID NO. of amino acid 
sequence 


NT2NE20082130 


900 


1026.. 1457 


2870 


NT2NE20082600 


901 


688. 1227 


2871 


NT2NE20086070 


902 


503.. 823 


2872 


NT2NE20 087270 


903 


276. .1709 


2873 


NT2N E20 087850 


904 


9. .326 


2874 


NT2NE20088030 


905 


682.. 1023 


2875 


NT2NE20092950 


906 


142.. 1884 


2876 


NT2NE20095230 


907 


547 1 602 


2877 


NT2NE201 04000 


908 


21 7.. 702 


2878 


NT2NE20 107810 


909 


29. .376 


2879 


NT2NE201 08420 


910 


1135 >2525 


2880 


NT2NF2011 1190 


911 


202.. 651 


2881 


NT2NE2011 2210 


912 


963 2231 


2882 


NT?NF?011 4R50 


913 


1441 >1794 


2883 


NT2NE2011 7580 


914 


271 ..912 


2884 


NT?N F20 1 1 99R0 


915 


445 843 


2885 


KITPNI F20 1 236 1 0 


91 6 


11 8. .717 


2886 


NTPNFP01 P4570 


91 7 


1279 1620 


2887 


NT2NF201 26030 


918 


1 054 1 548 


2888 


NT2M F20 1 27900 


91 9 


85 1 599 


2889 


NT2NE20140130 

IN 1 bll L.CU 1 *tU 1 UV 


920 


355 753 


2890 


NT2NF201402R0 


921 


126 989 


2891 


NT2NF20141 040 

IN i £.IN 1 ' 1 J 1 «+ 1 UHU 


922 


53. .811 


2892 


NT2NF201 45250 


923 


94. .531 


2893 


NT2NF201 46510 

IN 1 £1 IN at—KJ 1 HUJ 1 \J 


924 


125. .661 


2894 


NT2NF2014R690 

In 1 £- 1 N iZcHJ 1 


925 


544.. 861 


2895 


NT2NE20 149500 


926 


454 888 


2896 


In I n c<c.u I v)uu I w 


QP7 


73.. 432 


2897 


NTPNIFP01 596P0 
In 1 jcInl-^IVJ 1 OcO^U 


928 


10R0 ?51R 


2898 


NT9NJFP01 5^65>fi 

In 1 ^IMCcU 1 OOOcU 


57^57 


1R1P 
1 OO. . 1 J It 


2899 


NT9NJFP01 55650 

In 1 c-Wv-t—VJ 1 JJOOU 


930 


716 1210 


2900 


MTPW FPn 1 <%7 1 *>Ci 


931 


927.. 1 271 


2901 


In 1 ClMCcU 1 DJ 1 


932 


85. .531 


2902 


IN 1 £.INl£.U 1 Or DDU 


900 






in i ^imc^u i / oy/u 


Q^4 


974 -».91ftR 




MT9WP9ni 7701 n 
In i ^IMCdU 1 / r c IU 




I 1 o. .o / o 




NT2NE20181760 


936 


201. 785 


2906 


NT2NE20181800 


937 


784..1137 


2907 


NT2NE201 84720 


938 


1033.. 1506 


2908 


NT2RI20016240 


939 


359.. 748 


2909 


NT2RI20021200 


940 


534.. 875 


2910 


NT2RI20033920 


941 


269.. 1444 


2911 


NT2RI20093010 


942 


1478. .1816 


2912 


NT2RP70001120 


943 


172. .1518 


2913 


NT2RP70001 730 


944 


166.. 1935 


2914 


NT2RP70003110 


945 


15. .1868 


2915 


NT2RP70012830 


946 


280.. 2337 


2916 


NT2RP70022820 


947 


2819.. >3760 


2917 


NT2RP70027790 


948 


389.. 2965 


2918 
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Table 1 (continued) 



5 



10 



15 



20 



25 



Clone name 


SEQ ID NO. of nucleotide 


Position of CDS 


SEQ ID NO. of amino acid 




sequence 




sequence 


NT2RP70029780 


949 


156.. 1349 


2919 


NT2RP70030840 


950 


278.. 2983 


2920 


NT2RP70031070 


951 


148.. 1740 


2921 


NT2RP70031340 


952 


107.. 1576 


2922 


NT2RP70031480 


953 


203.. 2947 


2923 


NT2RP70035110 


954 


106. .765 


2924 


NT2RP7004641 0 


955 


155. .1582 


2925 


NT2RP70049610 


956 


834.. 1889 


2926 


NT2RP70056290 


957 


2748.. 31 82 


2927 


NT2RP70056690 


958 


253..>3053 


2928 


NT? RP70057500 


959 


216. .2615 


2929 


NT2RP70064570 

IN 1 CI 1 1 / \J\J V*"TO 1 V 


960 


636.. 2 162 


2930 


NT2RP70074800 


961 


199 684 


2931 


NT2RP70075300 


962 


226 1539 


2932 


NT2RP70075800 

IN 1 CI II / / OU UVs 


963 


211 3801 


2933 


NT2RP70080150 

IN 1 CI II / WUUU IOU 


964 


306.. 674 


2934 


NT? RP 70 084540 

IN 1 CI ll / WO*"tO*TVJ 


965 


8. .406 


2935 


NT?RP70087140 

in i ci ir # v<_» # i *ty 


966 


175.. 1074 


2936 


NT2RP70090870 

IN 1 C 1 II / VJUC7UO » W 


967 


272.. 261 7 


2937 


NTONG?000??30 


968 


50 >3211 


2938 


NTONG2000531 0 


969 


88.. 471 


2939 


NTONG2001 7620 

IN 1 VJINV3CVU 1 / OCU 


970 


40.. 432 


2940 


NTONG?0029850 

IN 1 VJ|\|V3cUUcCOJv 


971 


110 1258 


2941 


NTONG?0031 580 

IN 1 UI1IVJL.UUJ 1 \JUU 


972 


126. .863 


2942 


NTONG?003?1 00 

IN 1 VJIMVJICUUiJC 1 VU 


973 


65.. 11 32 


2943 


NTONG? 0034540 


974 


163 2523 


2944 


NTONG?00351 50 

IN 1 VJIMVJCUUxJJ IOW 


975 


543.. 2477 


2945 


NTONG?0043080 

IN 1 UINVjcUUtOUOv 


976 


22.. 2679 


2946 


NTONG20048440 


977 


89 1 849 


2947 


NTONG?004Q1 80 

IN 1 VJINVjlcUy+S 1 ou 


978 


1 55 1 300 

1 OO . . 1 O \J \J 


2948 


NTONG?0 05^610 


979 


321 3821 

OC 1 . . OUC 1 


2949 


NTONG?0051730 


980 


121 ..1 701 


2950 


NTONG?0051Q1 0 

IN 1 \-J 1 N VJICUvJOO v7 1 KJ 


981 


47.. 2758 


2951 


in i uinucuujdcuu 


Q8? 


1 ?? 1 QfiQ 

1 CC. . 1 


2952 


NTONri?on5flni o 

In 1 UIMUcUUOOU 1 \J 


983 


?37 1 55Q 

CO / . . 1 OOC7 


2953 


IN 1 UIMUcUUOOccU 


QR4 
yon 


fi7 1^14 


?Q54 


OCBBF20000740 


985 


165.. 2402 


2955 


OCBBF20001780 


986 


1990.. 2334 


2956 


OCBBF20005220 


987 


590.. 2299 


2957 


OCBBF20009820 


988 


281. .823 


2958 


OCBBF20011860 


989 


145.. 528 


2959 


OCBBF20012520 


990 


786.. 3023 


2960 


OCBBF20016390 


991 


667.. 161 7 


2961 


OCBBF20016810 


992 


123.. 2351 


2962 


OCBBF201 09450 


993 


204.. 506 


2963 


OCBBF201 09780 


994 


253. 573 


2964 


OCBBF201 10210 


995 


8.. 1072 


2965 


OCBBF201 10730 


996 


1346.. 1666 


2966 


OCBBF20111370 


997 


296.. 1345 


2967 
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Table 1 (continued) 



5 



10 



15 



25 



30 



35 



Clone name 


SEQ ID NO. of nucleotide 


Position of CDS 


SEQ ID NO. of amino acid 




sequence 




sequence 


OCBBF20111600 


998 


375. 1823 


2968 


OCBBF20112280 


999 


447.. 860 


2969 


OCBBF201 12320 


1000 


72. .857 


2970 


OCBBF20113110 


1001 


292.. 738 


2971 


OCBBF201 15360 


1002 


1182. .2126 


2972 


OCBBF201 16250 


1003 


124. .2175 


2973 


OCBBF201 17220 


1004 


605. .937 


2974 


OCBBF201 18720 


1005 


131 ..451 


2975 


OCBBF20119810 


1006 


1483.. 2478 


2976 


OCBBF201 20010 


1007 


216. .1106 


2977 


OCBBF201 20950 


1008 


91. .1629 


2978 


OCBBF20121910 


1009 


235.. 2580 


2979 


OCBBF201 23200 


1010 


1046 1411 


2980 


OCBBF201 42290 


1011 


94.. 900 


2981 


OCBBF20 147070 


1012 


1193 3256 


2982 


OCBBF201 52330 


1013 


284. .613 


2983 


OCBBF201 55030 


1014 


188. .724 


2984 


OCBBF201 56450 


1015 


104..1132 


2985 


OCBBF201 57970 


101 6 


1390 2313 


2986 


OCBBF201 60380 


1017 


1176 3884 


2987 


OCBBF201 65900 


1018 


407.. 934 


2988 


OCBBF201 6591 0 


1019 


1260 2495 


2989 


OCBBF201 66890 


1020 


201 ..1121 


2990 


OCBBF201 66900 


1021 


350 >2606 


2991 


OCBBF201 67290 


1022 


186 2858 


2992 


OCBBF201 70350 


1023 


8. .499 


2993 


OCBBF201 74580 


1024 


179.. 1294 


2994 


OCBBF201 74890 


1025 


1 78 1 380 


2995 


OCBBF201 75360 


1 026 


iQQ4 1711 

1 OJ7*T. .1/11 


2996 


OCBBF201 76650 


1027 


121 ..423 




OCBBF201 77540 


1 028 


334 1 242 




OCBBF201 7791 0 


1 029 


1 047 1 ftQQ 

1 ut / . . 1 ODD 


QQQQ 


OCBBF201 82060 


1 030 


Q64 1 827 


3000 


OCRBF201 R5630 


1 031 


1R3Q 5>1Q^ 


OUU 1 


OCBBF201 88280 


1 032 


\J/—£L . . U C7 L> 


3002 




1 033 


47 ?3n^ 


ouuo 


PANCR1 0000860 


1034 


20 .376 


3004 


PEBLM1 0001 470 


1035 


155.. 1960 


3005 


PEBLM20001800 


1036 


59.. 1549 


3006 


PEBLM20003260 


1037 


660.. 1121 


3007 


PEBLM20005020 


1038 


315..647 


3008 


PLACE50001290 


1039 


620.. 1030 


3009 


PLACE50001390 


1040 


156.. 1376 


3010 


PLACE60001910 


1041 


1521.. 2252 


3011 


PLACE60004260 


1042 


562.. 924 


3012 


PLACE60006300 


1043 


12.. 725 


3013 


PLACE60011180 


1044 


526..894 


3014 


PLACE60012620 


1045 


106.. 1368 


3015 


PLACE60017120 


1046 


1116.. 1454 


3016 
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Table 1 (continued) 



5 



10 



15 



20 



25 



Clone name 


SEQ ID NO. of nucleotide 


Position of CDS 


SEQ ID NO. of amino acid 




sequence 




sequence 


PLACE60052940 


1047 


385..>2108 


3017 


PLACE60053280 


1048 


161..>2687 


3018 


PLACE60054230 


1049 


189.. 1550 


3019 


PLACE60054820 


1050 


298.. 852 


3020 


PLACE60054870 


1051 


105. .2213 


3021 


PLACE60055350 


1052 


1188. .1562 


3022 


PLACE60055460 


1053 


160. .1197 


3023 


PLACE60055590 


1054 


123 1406 


3024 


PLACE60056910 


1055 


1028.. 1549 


3025 


PLACE60057860 


1056 


1617. .1940 


3026 


PLACE60061370 


1057 


1 ..1899 


3027 


PLACE60062660 


1058 


97 954 


3028 


PLACE60062870 


1059 


604 1 278 


3029 


PLACE60063940 


1060 


459 797 


3030 


PLACE60064180 


1061 


167.. 1240 


3031 


PLACE60064740 


1062 


302 952 


3032 


PLACE60066970 


1063 


6Q7 1 QQR 


3033 


PLACE6006871 0 


1064 


363 1 1 39 


3034 


PLACE60069880 


1 065 


898 1 374 


3035 


PLACE60070500 


1066 


765 1718 


3036 


PLACE60071 800 


1067 


689 1333 

» ■ 1 www 


3037 


PLACE60072390 


1068 


145.. 522 


3038 


PLACE60072420 


1069 


289 684 


3039 


PLACE60073090 


1070 


384 1814 


3040 


PLACE60074820 


1071 


38.. 727 


3041 


PLACE60077870 


1072 


2280 2609 


3042 


PLACE60080360 


1073 


285 938 


3043 


PL ACE60081 260 


1074 


1 1 <J£~ . . 1 ouw 


°.n44 


PLACF60082850 


1 075 


003 ..II vJ/i 




PLACE60087680 


1076 


3 33Q 


°.n4R 

OUHO 


PLACE60088240 


1 077 


94 570 


out/ 


PLACE60092280 


1 078 


180 8RQ 

I OU . .OD3 


°.H4ft 

OUHO 


PLACE60092370 


1 079 


371 ..691 


^n4Q 


PI AOFROOQ^SO 


1 080 


4P.°. 1 °.q«r 


ouou 


PI APFfi00Q^240 


1 081 


aOD. . 1 J l£ 


OUO I 


pi Af^FRnnQRRnn 


1 082 






PLACE60098350 


1083 


86.. 1852 


3053 


PLACE601 04630 


1084 


1267.. 1671 


3054 


PLACE601 05680 


1085 


1290. .1631 


3055 


PLACE601 07010 


1086 


128.. 829 


3056 


PLACE601 09910 


1087 


4.. 567 


3057 


PLACE601 13340 


1088 


127.. 1518 


3058 


PLACE601 18810 


1089 


101. .1657 


3059 


PLACE601 19700 


1090 


199..516 


3060 


PLACE601 20280 


1091 


61.. 705 


3061 


PLACE601 22970 


1092 


861..>1396 


3062 


PLACE601 32200 


1093 


240.. 1082 


3063 


PLACE601 32320 


1094 


471. .1121 


3064 


PLACE601 32880 


1095 


495. .1406 


3065 
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Clone name 


SEQ ID NO. of nucleotide 


Position of CDS 


SEQ ID NO. of amino acid 




sequence 




sequence 


PLACE601 38840 


1096 


729. 1553 


3066 


P LAC E60 140640 


1097 


840.. 1919 


3067 


PLACE60150510 


1098 


21 5.. 1246 


3068 


P LAC E60 154450 


1099 


25.. 873 


3069 


PLACE601 55910 


1100 


255. 701 


3070 


P L AC E60 157310 


1101 


1156.. 1482 


3071 


PLACE60162100 


1102 


885.. 1226 


3072 


P L AC E60 175640 


1103 


242.. 556 


3073 


PLACE601 77880 


1104 


104. 1327 


3074 


PLACE601 77910 


1105 


59.. 442 


3075 


PLACE60181870 


1106 


131. .673 


3076 


PLACE601 84410 


1107 


995.. 1483 


3077 


PLACE601 84870 


1108 


86..>1402 


3078 


PLACE601 88630 


1109 


571.. 1395 


3079 


PROST1 0001 100 


1110 


716. .1219 


3080 


PROST1 0001 360 


1111 


1506.. 1997 


3081 


PROST1 00021 50 


1112 


840.. 2345 


3082 


PROST20007170 


1113 


216. .1679 


3083 


PROST20007600 


1114 


29.. 742 


3084 


PROST20011160 


1115 


150.. 548 


3085 


PROST20011800 


1116 


1297.. 1629 


3086 


PROST20014140 


1117 


1901 ..3022 


3087 


PROST20014150 


1118 


1632.. 1961 


3088 


PROST200 14650 


1119 


1243.. 1653 


3089 


PROST200 15210 


1120 


711. .1499 


3090 


PROST200 15400 


1121 


49.. 399 


3091 


PROST200 16760 


1122 


62.. 1870 


3092 


PROST20022120 


1123 


301. .762 


3093 


P ROST20024250 


1124 


38.. 445 


3094 


P ROST20028970 


1125 


30. .821 


3095 


P ROST20033240 


1126 


39 1 682 


3096 


PROST200351 70 


1127 


689 1 078 


3097 


P ROST20035830 


1128 


1154 1468 


3098 


P ROST20036280 


1129 


1 801 


3099 


P ROST20036350 


1130 


22. .21 75 


31 00 


P ROST20039300 


1131 


87.. 635 


31 01 


PROST20041460 


1132 


427.. 855 


3102 


PROST20042700 


1133 


461. .1081 


3103 


PROST20045700 


1134 


472.. 975 


3104 


PROST20047440 


1135 


1058.. 1441 


3105 


PROST20048170 


1136 


73.. 555 


3106 


PROST20050390 


1137 


1358.. 2035 


3107 


PROST20051310 


1138 


1913..2737 


3108 


PROST20052720 


1139 


1195. .1515 


3109 


PROST20052850 


1140 


430..1185 


3110 


PROST20054660 


1141 


55. 753 


3111 


PROST20058860 


1142 


506..892 


3112 


PROST20060200 


1143 


103.. 429 


3113 


PROST20062820 


1144 


415.. 1215 


3114 
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Clone name 


SEQ ID NO. of nucleotide 


Position of CDS 


SEQ ID NO. of amino acid 




sequence 




sequence 


PROST20063430 


1145 


126. .1199 


3115 


PROST20065100 


1146 


95.. 1522 


3116 


PROST20065790 


1147 


225.. 2555 


3117 


PROST20073280 


1148 


891.. 1277 


3118 


PROST20075280 


1149 


97. .885 


3119 


PROST20078710 


1150 


347.. 1213 


3120 


PROST20082430 


1151 


2077.. 2436 


3121 


P ROST20084470 


1152 


728 1 498 


3122 


PROST20084680 


1153 


395.. 733 


3123 


PROST20084720 


1154 


1528.. 1839 


3124 


P ROST20087240 


1155 


766 1 452 


3125 


PROST20093470 


1156 


1 288 1 647 


3126 


P ROST20094000 


1157 


376 1011 


3127 


PROST2009731 0 


1158 


1502.. 1903 


3128 


P ROST20097360 


1159 


99.. 602 


31 29 


PROST20097840 


11 60 


687 128Q 


3130 


P ROST20099090 


1161 


85.. 792 


3131 


PROST201 021 90 


1162 


86. .406 


31 32 


PROST201 02500 


11 63 


1467 2051 


3133 


PROST201 03820 


11 64 


1770 2078 


3134 


PROST201 05450 


11 65 


1027 1911 


3135 


PROST201 06060 


1166 


97.. 705 


3136 


PROST201 08850 


11 67 


1 1 73 1 664 


3137 


P ROST20 110120 


11 68 


804 1 1 39 


31 38 


PROST201 1 41 00 


1169 


74.. 772 


3139 


PROST201 20070 


1170 


442 1 593 


3140 


PROST201 21 570 


11 71 


673 1 1 85 


31 41 


P ROST20 1 22490 


1172 


1 0..753 


3142 


P ROST20 1 24000 


1173 


943. .1371 


3143 


P ROST20 1 25420 


11 74 


1480 1Q68 

1 tOU.. 1 C7UO 


3144 


P ROST20 1 27450 


11 75 


255 R57 


31 45 


P ROST20 1 30320 


11 76 


54Q 230Q 


31 46 


PROST201 38730 


11 77 


494. .829 


^1 47 


PROST901 465Q0 


1178 


O I O. .£.00 I 


o I *+0 


PROST201 51 370 


1 1 79 


QQ 4R2 


^1 AO 


PROSTP01 59510 


1 1 80 


1 UcfO.,*L 1 DO 


O I ou 


PROST20 152870 


1181 


258.. 1322 


3151 


PROST20 155370 


1182 


606.. 2255 


3152 


PROST20 156360 


1183 


1557.. 1898 


3153 


PROST20 159320 


1184 


67.567 


3154 


PROST20 168600 


1185 


336.. 1700 


3155 


PUAEN 10000650 


1186 


341. .943 


3156 


PUAEN 10000870 


1187 


1676.. 2029 


3157 


PUAEN 1 0001 640 


1188 


332.. 11 38 


3158 


PUAEN20000800 


1189 


327.. 1811 


3159 


PUAEN20001520 


1190 


155.. 1243 


3160 


PUAEN20002470 


1191 


3. .2417 


3161 


PUAEN20003120 


1192 


93.221 6 


3162 


SALGL1 0001 070 


1193 


89..997 


3163 
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Clone name 


SEQ ID NO. of nucleotide 


Position of CDS 


SEQ ID NO. of amino acid 




sequence 




sequence 


SKMUS20006790 


1194 


389. .1330 


3164 


SKMUS20007260 


1195 


47..>1320 


3165 


SKMUS20008730 


1196 


91. .1407 


3166 


SKMUS20017400 


1197 


84. .815 


3167 


SKMUS20020770 


1198 


179. .781 


3168 


SKMUS20026340 


1199 


57. 1202 


3169 


SKMUS20040440 


1200 


27. .1091 


31 70 


SKMUS20064810 


1201 


91 ..459 


31 71 


SKMUS20073150 


1202 


17..>822 


3172 


S KMUS20073590 


1203 


438.. 824 


31 73 


SKMUS200791 50 


1204 


165.. 1235 


31 74 


SKMUS20091 900 


1205 


57.. 359 


31 75 


SKNMC1 0001 230 


1206 


87.. 1814 


31 76 


SKNMC20006350 


1207 


326 1738 


31 77 


SKNSH1 0001 010 


1208 


1189 1530 


31 78 


SKNSH20007160 


1209 


113. .562 


31 79 


SKNSH20009710 


1210 


71 .622 


31 80 


SKNSH20030640 


1211 


1635 2090 


31 81 


SKNSH20040390 


1212 


246.. 911 


31 82 


SKNSH20052400 


1213 


926 2017 


31 83 


SKNSH20057920 


1214 


369 1352 


31 84 


SKNSH20068220 


1215 


504.. 1235 


31 85 


SKNSH20094350 


1216 


70.. 591 


3186 


SMINT20000070 


1217 


752 1858 


31 87 


S M 1 NT20002320 


1218 


607 1761 


31 88 


SMINT20006020 


1219 


1215 2348 


31 89 


SMINT20006090 


1220 


75.. 602 


31 90 


S MINT? 00 074 70 


1221 


154 16S0 


31 91 


SMINT20008110 


1222 


784 1 S57 


3192 


SMINT20011830 


1223 


34 ..447 


31 93 


SMINT20011950 


1224 


732 2006 


31 94 


S M 1 NIT200 1 2220 

Kj I V I 1 1 tJ 1 £- \J\J 1 £-C-t-\J 


1225 


p^Q gig 


31 95 


S M 1 NT200 1 3970 

O I VI 1 1 M I C—\J\J 1 Uv7 / U 


1226 


1117 1686 

1 1 1 / . . 1 UUU 


31 96 


^MiWTpnm dRi n 


1 ooi 

1 § 


cLO § . . O If 


11 Q7 

U 1 91 


^MIIMTPOOI 61 SO 

OIVIIIN 1 1 U \ JU 


1228 


Q11 167R 


11 Qft 


SMIIMTP001 711 0 

O IVI UN 1 C\J\J \ 1 O IU 


1 229 


1?9ft 1R61 


11 QQ 


SMINT20021260 


1230 


1066.. 1407 


3200 


SMINT20023110 


1231 


1696. .2073 


3201 


SMINT20024140 


1232 


31.. 555 


3202 


SMINT20026200 


1233 


323.. 21 70 


3203 


SMINT20028800 


1234 


476.. 1696 


3204 


SMINT20028840 


1235 


89.. 823 


3205 


SMINT20030740 


1236 


172.. 1833 


3206 


SMINT20031280 


1237 


542.. 1489 


3207 


SMINT20035050 


1238 


227.. 21 49 


3208 


SMINT20035510 


1239 


168..>1891 


3209 


SMINT20036440 


1240 


207.. 2084 


3210 


SMINT20038660 


1241 


289.. 1449 


3211 


SMINT20039050 


1242 


175..>2338 


3212 
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Clone name 


SEQ ID NO. of nucleotide 


Position of CDS 


SEQ ID NO. of amino acid 




sequence 




sequence 


SMINT20043390 


1243 j 


1465.. 2094 


3213 


SMINT20044140 


1244 


30. .1718 


3214 


SMINT20044730 


1245 


45. .1787 


3215 


SMINT20045470 


1246 


1156. .1539 


3216 


SMINT20045830 


1247 


287.. 1882 


3217 


SMINT20045890 


1248 


893.. 1372 


3218 


SMINT20047290 


1249 


2..637 


3219 


S M 1 NT20048720 


1250 


1834 >2374 


3220 


S M 1 NT20049920 


1251 


1384.. 1860 


3221 


SMINT20052130 


1252 


271 ..891 


3222 


SMINTP0OS4050 


1253 


361 1839 


3223 


SMINTP0056P30 

OIVIIIM I ilUU JUiCOL' 


1254 


73 1674 


3224 


SMINTP0056P40 

OIVIIIm 1 LUUJULtU 


1255 


1108 1533 


3225 


SMINTP006P0R0 

OIVIIIM 1 ilUUL)£UJiJ 


1256 


75 1 964 


3226 


iwi i ntp o n R70 r o 

OIVI 1 1 M 1 £UUO / uou 


1257 


13R pn7 c » 


3227 


9MINTP0070RP0 


1258 


44 445 


3228 


SMINTP0074330 


1259 


71 ..532 


3229 


SMINTP00779P0 

OIVIIIM 1 {—\J\Ji 1 \J C\J 


1 260 


843 1 1 66 

U^v. , 1 IUU 


3230 


SMINTP0077QR0 


1261 


1010 P467 


3231 


SMINTP0081 330 


1262 


250 618 


3232 


SMINTP0083PQ0 


1263 


80 1 606 


3233 


S M 1 NTP0084Q 1 0 

OIVIIIM 1 CVvU^V i V/ 


1264 


664 Q87 


3234 


R M 1 NTPOORRI 1 n 

OIVIIIM 1 C\J\J\J^J'-J 1 V/ 


1265 


1PQ 500 


3235 


c ^MINTP00RR4 c ;n 

OIVIIIM 1 cl/UOOf ou 


1266 


66. .437 


3236 


9 M 1 NTP00R6PS0 

OIVIIIM 1 £-\J\J \J\J C--*J\J 


1267 


1 QRR PPR7 


3237 


C ^MINTP00R67P0 

OIVIIIM 1 L.UUUU/ i_ Vy 


1268 


P33 1003 


3238 


qiui | NT200R8440 

OIVIIIM 1 tDUOOttU 


1269 


31 ..438 


3239 


OIVIIIM 1 dUUOOOSU 


1 P7f) 


PR RRO 




OIVIIIM 1 ^UUOifC 1 U 


1 P71 

14./ 1 


1PRQ 1R7P 


O^H- I 


OIVIIIM I ^UUOaDUU 


1 P7P 


I . . I OOJ 




RMiMTPnnoi 1 Qn 

OIVIIIm I C.\J\}\3 I 1 i?\J 


1 P7*} 


PQO 1 7RR 

£9U.. I / DO 




RMiNTpnnQPi pn 

OIVIIIM 1 tLV\J<3tL 1 jiU 


1P74 


OO-.t I o 




^MiMTpnnQPi fin 

OIVIIIM 1 c.\J\Ji3£. 1 QU 


1 P7^ 


7RO 




q n/i I mtp n n q ^ r q n 

OIVIIIM 1 ^UUyoDJU 


1 P7R 


1 RPP OOAi 




oivii im i cuuyf i ou 


1 P77 


o i i ..you 




oiviiiM i tuuyHoou 




OR /CO 




SPLEN20005160 


1279 


2342. .2659 


3249 


SPLEN20005370 


1280 


1497.. 1856 


3250 


SPLEN20006950 


1281 


702.. 1343 


3251 


SPLEN20011350 


1282 


1127.. 1537 


3252 


SPLEN20012450 


1283 


134.. 442 


3253 


SPLEN20015030 


1284 


376.. 945 


3254 


SPLEN20015100 


1285 


1367..2137 


3255 


SPLEN20016500 


1286 


328.. 816 


3256 


SPLEN20017610 


1287 


940.. 1314 


3257 


SPLEN20017810 


1288 


101. .556 


3258 


SPLEN20019120 


1289 


176.. 487 


3259 


SPLEN20020530 


1290 


358. .762 


3260 


SPLEN20023430 


1291 


1067.. 1426 


3261 
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Clone name 


SEO ID NO of nurlpotirip 


Position of CDS 


SFO IO NO of amino arirl 




sequence 




sequence 


S PLEN20023540 


1292 


644 1 903 


3262 


SPLEN20023850 


1293 


467 >1879 


3263 


SPLEN20024190 


1294 


319 1368 


3264 


SPLEN2002451 0 


1295 


1070 1423 


3265 


S PL EN20024620 


1296 


3. .1514 


^Pfifi 

ocoo 


S PLEN20024770 


1297 


1 34 2353 


3267 


S PLEN2002493Q 


1298 


1150 3069 


3268 


SPLEN200291 70 


1299 


57 750 


^PRQ 
04.09 


SPLFNP003R780 


1300 


1 43P 1 7SP 


O4C f u 


SPl FN2003Q180 


1301 


^RQ 1 ^4 


^P71 
0^ / 1 


SPl FN20040780 


1302 


1 R7*^ P^4R 


^P7P 


qpi FNP0041A10 

orLLINtvUH IOIU 


1303 


ftm 114P 


0^ / 0 


qpi FNP004PP00 

O i l_C_ IN^.UU < + 4.4CUU 


1 ^04 

1 out 


1 1 r_ O . . 1 ,;00 


O^ / *+ 


O II LI 1 NUVJU'tO'tOv 


1305 


224 565 


O^ /O 




1 OUD 


^ 1 ftQQ 

OO . . I OiJS? 


0<£ / O 


spi FMPno/iofinn 

O rl_d\l4.UU*+OOOU 


1 m~7 

I OU / 


1 qOT 0071 
I OO / ..4.0/ I 


0077 


SPl FNPOOd^^O 


1 ouo 


^ 1^1^ 
0. . 10 10 


O.OVQ 
Od. / 0 


spi FNPnn4RRnn 


1 OU57 


0000 P*3RP 


O^ / 57 


SPl FNPn04QR4n 


1 ^1 n 

IO 1 u 


ftp f \ r K~?R. 
04. . .00 / 0 


ooon 
O^OU 


qpi FNPoo^noon 


1311 


41 ft PPOP 


O^O 1 


spi FNpnnRi4Pn 


1 *31P 


ROC 1 QQO 

000.. 1 yoo 


OOftO 


SPI FNPnO*i41fiO. 


1 O I o 


1 OD..ilUDy 


OOQO 
O^OO 


spi FNPnnR4^nn 


1314 


RR7 1 44_q 


OOR/1 
O^O** 


spi FNPnn^^Rnn 


1 qi 


1 4R 1 nqq 
1 fu . . 1 UOO 


04:00 


spi FMpnn^7fl^n 


1 R 

IO 1 o 


R(Y7 11RO. 
OU /..II OO 


04DD 


spi FNPo.oc;7Qno. 

OrLCINcUUO / S7UU 


H017 
IOI / 


1 ftftO OAOI 
1 0OO..4.*K5l 


0007 


spi FNPnn^msn 


1 ^1 ft 

1 O 1 o 


qoo 11 on 

OUO. . I 1 4.U 


04:00 


spi FMpnn^Qpyn 


io iy 


1 04... 1 OOO 


04. oy 


qpi pMonnfiOft^n 

OrLCIMtUUOtOOU 


I OcU 


OOO TftC 


OOQO 

o4.yu 


O ~I_CIN*1UU00/10U 


1 qoi 
1 04. 1 


Ol O.. 1 OOO 


04. y 1 


spi FMpnoR^ftoo. 


1 ooo 


"f OO QQR 

1 00. .yyo 


OOQO 




1 OOO 
1 04.0 


07no _ noon 
^ / u*i . . >o U<^o 


0000 

0430 


QPI PM9nr>71ft9fl 


1 OO/l 
I 04! 4 !- 


0/0. .1 oy<c 


04 y 4 


qpi PM9fin7<i^fin 


1 OOC 
1 04.0 


1 / 4..4T4: 1 U 


OOQC 

04 yo 


qpi PMono70pao 


I 04.0 


<i^u..*ioyo 


OOQCJ 

04^0 


QDI cmooo7*51qo 


lO<l/ 


on OO -1 
^U..oo 1 


3297 


SPLEN20076470 


1328 


575.. 1714 


3298 


SPLEN20080070 


1329 


5.. 1339 


3299 


SPLEN20081640 


1330 


946.. 1308 


3300 


SPLEN20085910 


1331 


1084.. 1434 


3301 


SPLEN20087370 


1332 


2670.. 3098 


3302 


SPLEN20087860 


1333 


255.. 755 


3303 


SPLEN20090880 


1334 


1742.. 21 94 


3304 


SPLEN20098030 


1335 


268.. 1077 


3305 


SPLEN201 00040 


1336 


101. .862 


3306 


SPLEN20101950 


1337 


122.. 1198 


3307 


SPLEN201 04150 


1338 


1146. .2219 


3308 


SPLEN201 04690 


1339 


661. .990 


3309 


SPLEN20105100 


1340 


1425.. 1859 


3310 
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Table 1 (continued) 



5 



w 



15 



20 



25 



Clone name 


SEQ ID NO, of nucleotide 


Position of CDS 


SEQ ID NO. of amino acid 




sequence 




sequence 


SPLEN201 08000 


1341 


887.. 1351 1 


3311 


SPLEN201 08460 


1342 


2254.. 2607 


3312 


SPLEN20110180 


1343 


147.. 1292 


3313 


SPLEN201 10210 


1344 


1831 ..2271 


3314 


SPLEN201 10860 


1345 


1489.. 1887 


3315 


SPLEN20111450 


1346 


2768.. 3208 


3316 


SPLEN201 14190 


1347 


154.. 1440 


3317 


SPLEN201 16720 


1348 


9.. 1583 


3318 


SPLEN20117580 


1349 


21 8.. 1240 


3319 


SPLEN20118050 


1350 


145.. 11 22 


3320 


SPLEN20121790 


1351 


569.. 1 054 


3321 


SPLEN201 25230 


1352 


971 ..1324 


3322 


SPLEN20126110 


1353 


254. .991 


3323 


SPLEN201 35030 


1354 


40.. 1848 


3324 


SPLEN201 36700 


1355 


1296 1628 


3325 


SPLEN201 36730 


1356 


1312 1644 

I w It-.. 1 \J I T 


3326 


S PLEN201 37530 


1357 


2676 3221 


3327 


S PLEN201 38600 


1358 


1166 1582 

I 1 w W . > 1 JUL 


3328 


SPLEN20139100 


1359 


63 1532 


3329 


SPLEN201 39360 


1360 


763 1 827 


3330 


SPLEN201 75920 


1361 


1123 1590 


3331 


SPLEN201 76130 


1362 


16.. 549 


3332 


SPLEN201 77400 


1363 


1078 1431 


3333 


SPLEN201 80980 


1364 


41 0..823 


3334 


SPLEN20181570 


1365 


214 >2609 


3335 


SPLEN201 82850 


1366 


168 659 


3336 


SPLEN201 82990 


1367 


55 1881 


3337 


SPL EN201 83020 


1368 


265.. 723 


3338 


SPLEN201 83950 


1369 


402.. 746 


3339 


SPLENI201 87490 


1370 


98 >2458 


3340 




1371 


Q14 1^4? 




SPLEN201 90430 


1372 


884 >1Q42 


3342 


SPLEN201 90770 


1373 


97 771 


3343 


SPLEN201 91 020 


1374 


140.. 1426 


HAA 


^PLFN201 92570 


1375 






Or LCIMcU 1 oOcOU 


1 17FI 


OCQ COO 
COO..OOO 




SPLEN201 93490 


1377 


321. .854 


3347 


SPLEN201 93750 


1378 


845.. 1306 


3348 


SPLEN201 93790 


1379 


290.. 2278 


3349 


SPLEN201 95710 


1380 


64.. 51 6 


3350 


SPLEN201 97090 


1381 


11. .550 


3351 


SPLEN201 97740 


1382 


2.499 


3352 


SPLEN201 97930 


1383 


803..>1625 


3353 


SPLEN201 98390 


1384 


14.. 1873 


3354 


SPLEN201 99850 


1385 


575.. 1009 


3355 


SPLEN20200070 


1386 


353.. 655 


3356 


SPLEN20200340 


1387 


1047.. 1814 


3357 


SPLEN20201830 


1388 


143.. 1177 


3358 


SPLEN20203590 


1389 


160..471 


3359 
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Table 1 (continued) 



Clone name 


SEQ ID NO. of nucleotide 


Position of CDS 


SEQ ID NO. of amino acid 




sequence 




sequence 


SPLEN20204670 


1390 


6.. 311 


3360 


SPLEN20205120 


1391 


682.. 1542 


3361 


TESOP1 0000350 


1392 


1646..2101 


3362 


TESOP10001600 


1393 


282.. 1352 


3363 


TEST) 100001 90 


1394 


84.. 2042 


3364 


TEST1 10000850 


1395 


925. .1311 


3365 


TESTI10001570 


1396 


141. .1919 


3366 


TESTI20004310 


1397 


55.. 2484 


3367 


TESTI20005980 


1398 


1350..>1B66 


3368 


TESTI20006160 


1399 


53.. 1720 


3369 


TESTI20006830 


1400 


313. .1719 


3370 


TESTI20012080 


1401 


89. .1867 


3371 


TESTI20012360 


1402 


178.. 594 


3372 


TEST120016970 


1403 


133.. 1842 


3373 


TESTI20019590 


1404 


210. .752 


3374 


TESTI20028020 


1405 


201.. 1298 


3375 


TESTI20029100 


1406 


1743.. 21 47 


3376 


TESTI20030200 


1407 


94. .2091 


3377 


TESTI20030440 


1408 


60. .1931 


3378 


TESTI20030610 


1409 


547.. 1371 


3379 


TESTI20031310 


1410 


889.. 2235 


3380 


TESTI20031410 


1411 


201 ..1655 


3381 


TESTI20032770 


1412 


163.. 1002 


3382 


TESTI20034750 


1413 


472.. 804 


3383 


TESTI20035330 


1414 


13. .558 


3384 


TESTI20035790 


1415 


16. .21 63 


3385 


TESTI20038240 


1416 


163.. 2502 


3386 


TESTI20040850 


1417 


52.. 669 


3387 


TESTI20041630 


1418 


330. 1526 


3388 


TESTI20043130 


1419 


410. .715 


3389 


TESTI20043180 


1420 


68. .2515 


3390 


TESTI20043220 


1421 


1496.. 1957 


3391 


TESTI20043910 


1422 


37 .1728 


3392 


TESTI20043990 


1423 


311. .1153 


3393 


TESTI20044900 


1424 


234.. 1595 


3394 


TFSTI20045390 


1425 


156 2108 


3395 


TEST120045740 


1426 


199..549 


3396 


TESTI20046110 


1427 


24.. 1826 


3397 


TESTI20046490 


1428 


554.. 2542 


3398 


TEST120046540 


1429 


123.. 2309 


3399 


TESTI20046870 


1430 


278.. 1645 


3400 


TESTI20046890 


1431 


276.. 2702 


3401 


TESTI20047370 


1432 


2..2134 


3402 


TESTI20047930 


1433 


239. .1843 


3403 


TESTI20049060 


1434 


1191 ..1784 


3404 


TESTI20049410 


1435 


281. .21 70 


3405 


TESTI20049990 


1436 


355..915 


3406 


TESTI20050170 


1437 


31 4.. 808 


3407 


TESTI20050400 


1438 


111. .440 


3408 
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Table 1 (continued) 



5 



10 



15 



20 



25 



Clone name 


SEQ ID NO. of nucleotide 


Position of CDS 


SEQ ID NO. of amino acid 




sequence 




sequence 


TESTI 20050720 


1439 


68.1621 


3409 


TESTI20051200 


1440 


1561. .2145 


3410 


TESTI20051730 


1441 


183.. 1943 


3411 


TESTI20052670 


1442 


759. .1910 ' 


3412 


TESTI20053070 


1443 


98. 1285 


3413 


TESTI20053260 


1444 


456. .842 


3414 


TESTI20053780 


1445 


176. .1948 


3415 


TESTI20053800 


1446 


42.. 1694 


3416 


TESTI20053950 


1447 


528. >1 782 


3417 


TESTI20054700 


1448 


19. .1956 


3418 


TESTI20055680 


1449 


53.. 1510 


3419 


TESTI20055880 


1450 


127.. 1644 


3420 


TESTI20056030 


1451 


543.. 1658 


3421 


TESTI20057200 


1452 


279. .611 


3422 


TESTI20057430 


1453 


131 ..1858 


3423 


TEST120057590 


1454 


37.. 1062 


3424 


TESTI20057840 


1455 


180. .1271 


3425 


TESTI20057880 


1456 


39. .1697 


3426 


TESTI20058350 


1457 


25. .1368 


3427 


TESTI20058920 


1458 


108. .1088 


3428 


TESTI20059080 


1459 


2134. .3069 


3429 


TESTI20059330 


1460 


540.. 845 


3430 


TESTI20059370 


1461 


655. 11 37 


3431 


TESTI20059480 


1462 


52.. 1362 


3432 


TESTI20059790 


1463 


203 1336 


3433 


TESTI20059810 


1464 


324 >1914 


3434 


TESTI20060080 


1465 


172. .858 


3435 1 


TESTI20060150 


1466 


383 1 663 


3436 


TESTI20060350 


1467 


81 ..1427 


3437 


TESTI20060450 


1468 


1306 1752 


3438 


TESTI20060830 


1469 


54.. 2543 


3439 


TESTI20061 090 


1470 


382 7Q8 


3440 


TFSTIP0061 200 


1471 


QQO 1 Q^fi 

\J\JC-.. 1 \7\J\J 


3441 


TFSTI200621 20 

1 CIO 1 l£UUU£ 1 C- KJ 


1472 


37.. 1 632 


3442 


TFSTI200621 80 


1473 


169 141*3 


3443 


TFSTI20062580 


1474 


UwO. . 1 1 UU 


3444 


TESTI20063330 


1475 


1334.. 1678 


3445 


TESTI20063410 


1476 


21 75. .2501 


3446 


TESTI20063600 


1477 


1399.. 1809 


3447 


TESTI20064370 


1478 


201.. 1706 


3448 


TESTI20064530 


1479 


68.. 2737 


3449 


TESTI20064650 


1480 


353.. 2542 


3450 


TESTI20064990 


1481 


733.. 1290 


3451 


TESTI20065650 


1482 


374.. 1609 


3452 


TESTI20066150 


1483 


98. 2473 


3453 


TESTI20066170 


1484 


143. .1309 


3454 


TESTI20066280 


1485 


100. .879 


3455 


TESTI20066330 


1486 


26. 1528 


! 3456 


TESTI20066590 


1487 


306.. 740 


3457 
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Table 1 (continued) 



5 



10 



15 



20 



25 



Clone name 


SEO ID NO of nucleotide 


Position of CDS 


SEQ ID NO. of amino acid 




sequence 




sequence I 


TEST120066650 


1488 


41 5.. 1362 


3458 


TESTI20067350 


1489 


342 1073 


3459 


TESTI20067440 


1490 


196 1887 


3460 


TESTI20067480 


1491 


127.. 1758 


3461 


TESTI20068530 


1492 


791 1369 


3462 


TESTI20Q68790 


1493 


1033 >1738 


3463 


TESTI20068Q40 


1494 


55 >1867 


3464 


TE t VT!20070400 


1495 


147 2300 


3465 


TF^Ti2no7074o 


1496 


743.. 1480 


3466 


TF C >T120071 "1 30 


1497 


224.. 1 738 


3467 


TF c iTt20071 R30 


1498 


38.. 727 


3468 


TF c Vn200734R0 


1499 


97.. 1584 


3469 


T F *>T 1 20 0 7524 o 


1500 


P5Q 1 gag 


3470 


TF c ^Tipnn7fiR7n 

1 CO 1 l&UU / DO / u 


1501 


622 ..1110 


3471 


TF^TI2007RQ2n 


1502 


QO ROR 


3472 


TF c vnpoo7QORO 


1503 




3473 


TF c n"i2007Q2?o 


1504 


54 2252 


3474 


tf crn?no7QQRO 


1505 


ftp 1300 


3475 


TF c 5T1200R04Rn 


1506 


1 45R 1 7R7 


3476 


tfqtiooori rqo 

1 CO 1 l^LUUO 1 OzfKJ 


1507 


1 R4 25PO 


3477 


TF^TIPOOR^AQO 

1 CO 1 I^UV/0009U 


1508 


11 OP 1 5QR 


3473 


TF C }TI?00R4?50 


1509 


3Rft PO*35 


3479 


tf < ^tipoor5R70 


1510 


145Q P001 


3480 


TF^TI900RfiR40 

1 CO 1 lcwU0004U 


1511 


10.. 1488 


3481 


TF c VTIPOORRR40 

1 CO 1 ICUUOOOHU 


1512 


1 72.. 1587 


3482 


1 CO 1 I^UUOC7^C?U 


1 51 *5 


PR 4P1 


OhOO 


TFcrn?nnQoi ro 


1514 


5P7 1 R57 


3484 


1 CO 1 I^UUtJUiJ / u 


1 51 5 


54R P^OR 


*34R5 


TFQTionnai *3ro 

1 CO 1 I^UUi? 1 ODU 


1 t;i ft 

I 3 I D 


/ . . 1 


"34ftft 
0*400 


TFQTionoQoi 7n 

1 CO 1 I^UUv7^ 1 / u 


lOI/ 


QQO i OCR 


*54R7 


TFQTionoQ*3Qon 


iCHO 

lOlO 




o/oo 
O'fOO 


P CO 1 ItUUS^O^U 


IO la 


1 RQ7 OQ1Q 

i oy / . ,<io i y 


o*H-oy 


1 CO 1 I^UUyO^UU 




Q4 1 1 4Q 

y*+ . . i i **y 


*54Qn 
o*+yu 


t c QT i o n no n 
i co i i*LUuyo^*+u 


1 KOI 


11*54 1/1QR 


o*^y l 


TP QTI9nnQR77n 

I co I liiuuyo/ / u 


1 KQO 
1 3*1*1 


07Q ^OOOf^ 


Qyl QO 

o#y^ 


i co i i^iuuyooou 


1 3*10 


•i A A ^ 77C 


o4yo 


TESTI20097270 


1524 


157. 693 


3494 


TESTI20099350 


1525 


1 23.. > 1773 


3495 


TESTI20 100090 


1526 


1020.. 2057 


3496 


TESTI201 02390 


1527 


1873..2247 


3497 


TESTI201 03690 


1528 


71. .1048 


3498 


TESTI201 04090 


1529 


364..1716 


3499 


TESTI20105130 


1530 


320.. 1966 


3500 


TESTI201 05910 


1531 


151.. 2265 


3501 


TESTI20106170 


1532 


235.. 1338 


3502 


TESTI201 06820 


1533 


1191. .1604 


3503 


TESTI201 07240 


1534 


163. .2844 


3504 


TESTI201 07320 


1535 


377.. 787 


3505 


TESTI201 07340 


1536 


1229.. 1723 


3506 
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Table 1 (continued) 
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Clone name 


SEQ ID NO. of nucleotide 


Position of CDS 


SEQ ID NO. of amino acid 




sequence 




sequence 


TESTI201 08060 


1537 


371.. 925 


3507 


TESTI201 12540 


1538 


148. .1554 


3508 


TESTI201 12860 


1539 


285.. 1829 


3509 


TESTI20113150 


1540 


374.. 1042 


3510 


TESTI201 13940 


1541 


1185.. 1598 


3511 


TESTI201 14480 


1542 


161.. 2032 


3512 


TESTI201 16050 


1543 


218. .1756 


3513 I 


TESTI20116120 


1544 


17. .1276 


3514 


TESTI201 17500 


1545 


140.. 508 


3515 


TESTI2011 8460 


1546 


69.. 1580 


3516 


TESTI20 120500 


1547 


135. 926 


3517 


TESTI201 20900 


1548 


60.. 665 


3518 


TESTI20121040 


1549 


8.. 1525 


3519 


TESTI20121710 


1550 


278.. 1543 


3520 


TESTI20 122070 


1551 


145.. 1359 


3521 


TESTI20 122440 


1552 


25.. 465 


3522 


TESTI20 124440 


1553 


469. .801 


3523 


TEST120 125280 


1554 


371 ..1363 


3524 


TEST120 125440 


1555 


342. .671 


3525 


TESTI20 125920 


1556 


31 ..1680 


3526 


TESTI20 126280 


1557 


169.. 2562 


3527 


TESTI201 30530 


1558 


298.. 1692 


3528 


TSSTI20131440 


1559 


223.. 1278 


3529 


TESTI201 32310 


1560 


204.. 71 3 


3530 


TESTI201 32680 


1561 


1011 ..1448 


3531 


TESTI201 34010 


1562 


334.. 1464 


3532 


TESTI20 134270 


1563 


59. .601 


3533 


TESTI201 34680 


1564 


324.. 2726 


3534 


TESTI20 134970 


1565 


197.. 1873 


3535 


TESTI201 36010 


1566 


152 >1415 


3536 


TESTI20 140970 


1567 


376. .801 


3537 


TESTI201 42480 


1568 


1466. .1939 


3538 


TESTI201 42540 


1569 


1231 1593 


3539 


TESTI20143180 


1570 


201 1946 


3540 


TESTI20 144390 


1571 


776 1 552 


3541 


TESTI20 1 45780 


1572 


1315 1746 


3542 


TESTI20 148380 


1573 


32.. 2680 


3543 


TESTI201 49880 


1574 


581.. 982 


3544 


TESTI20 150420 


1575 


221. .1162 


3545 


TESTI201 50920 


1576 


142. .1713 


3546 


TESTI20151050 


1577 


1290.. 1736 


3547 


TESTI20151800 


1578 


671.. 1099 


3548 


TESTI20 152490 


1579 


717.. 1226 


3549 


TESTI20 153310 


1580 


151. .1317 


3550 


TESTI20 154370 


1581 


192.. 1544 


3551 


TESTI201 59380 


1582 


852.. 1679 


3552 


TESTI20161010 


1583 


33..S39 


3553 


TESTI20 162780 


1584 


607.. 1299 


3554 


TESTI20 162980 


1585 


224..1414 


3555 
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Clone name 


SEO ID NO of nucleotide 


Position of CDS 


CPO ID NO of amino arid 




sequence 




sequence 


TESTI201 64210 


1586 


526.. 1593 


3556 


TESTI201 65680 


1587 


154.. 552 


3557 


TESTI201 65990 


1588 


201 ..992 


3558 


TESTI201 66290 


1589 


351 2786 

WW 1 . . L— § KJ W 


3559 


TESTI201 66670 


1590 


754 1938 

/ W^T. . | WWW 


3560 


TESTI201 67580 


1591 


1152 >1543 


3561 


TESTI201 68880 


1592 


201 ..524 


3562 


TFSTI201 6Q600 


1593 


P8H 1 674 


^F?R^ 
ODDO 


TESTI20170170 


1594 


1 go 1 530 

f W W ■ » 1 H_J w w 


3564 


TESTI201 70280 


1595 


1 ..696 


3565 


TESTI201 70690 


1596 


61 ..474 


3566 


TFSTIP01 70890 

1 1 — ^-J 1 l£LU 1 / UCJC7U 


1597 


889 1 P^8 

Out . . 1 cOO 


3567 


TFSTI90171070 


1598 


1088 1601 

1 UOO, . 1 >J\J t 


^RR8 


TFSTIP01 7*3060 

1 I O 1 1 £- KJ 1 I \j\J\J\J 


1 599 


724.. 1218 




1 CO 1 1 C- vJ 1 / O 1 IU 


1 600 


11Q4 1686 


O vJ / u 


TP^TIPni 7°,Q60 


1 601 

1 OU 1 


O/D.. 1 / OH- 


^^71 

OO / I 


TF^TIPOI 76*370 


1 609 


0<lO. . 1 / 


oo / ^ 


TFSTIP01 764^0 

1 CO 1 1 1 / U*+«JU 


1 603 


1 QQ6 


^^7^1 
OO / o 


TF^TIPm 7Q9°.0 
1 L_ O 1 1 ^_VJ 1 / y^iou 


1 604 
1 Out 


1 RP 1 7HQ 




TF^TIPm 7Q61 n 


1 606 

1 DUO 


1 709 s9n49 


OC7C 
Oj / O 


TFSTI201 80600 

1 CO 1 \£L\J 1 OUUUU 


1 606 


16P 867 


^67R 
OO / o 


TF^TIPOI 8991 o 

1 CO 1 l^lU 1 O^-C. 1 KJ 


1 607 


7^6 1 1 ao 


OO / / 


TF^TIPm 89760 

1 CO 1 1 C- KJ 1 O/L. / uu 


1 608 

1 DUO 


ORR 1 Qnn 


OC7R 

OO / D 


1 CO 1 \C\J 1 OODOU 


1 60Q 


778 s1Q97 


OO / 17 


TF^TI901 84980 


1610 


OD..> J / 


ooou 


TF9TIP01 84760 

I CO I I £—KJ I Ot/ JU 


1611 
1 D I I 


QQQ OUR 


OOD I 


TF^TIPm 84760 

1 CO I 1 1 OH / OU 


1619 

ID l£ 


161 1741 
1 D 1 . . 1 / H 1 




TF^TIPm 84890 

1 CO 1 1 C.KJ 1 OHOtU 


1 613 


1 8Q 799 


oooo 


TF<^TI901 861 1 n 

1 CO 1 \£\J 1 DO 1 1 KJ 


1 R1 4 

10 14 


ooi 1 ROR 
C.C. 1 . . 1 DUO 


OOOH 


TF<^TI9ni Q9670 

I CO 1 l<CU 1 VCJZjl U 


1 R1 R 

ID IO 


1 flQ RQ7 

I uy..Do/ 


qcpc 
OOOO 


i co i i i youou 


1 R1 R 
ID ID 


14Q 91 QA 


oooo 


TF^TIPfil 0^690 

1 CO 1 \C-\J 1 c?OD£U 


ID 1 / 


OH 1 Q7R 
Oil.. IO / J 


OCR7 

ooo/ 


TF^TI901 Q4880 

1 CO f \t—KJ \ iJHOOU 


ID IO 


R 1 R 1 ^RA 
DID..1 004 


oooo 


i co \ \ yooyu 


1 R1 Q 

id iy 


ii/ t ..y4o 


3589 


TFQTI901 QRQ70 


1 ROft 


/oy..lODO 


QCQrt 

ooyu 


I CO I \d.K) \ if / uou 


1 O*! 1 


Ofto i *7H ~7 
^D0..1 / 1 / 


ooy i 


TESTI20 197290 


1622 


643.. 1314 


3592 


TESTI201 97600 


1623 


100.. 909 


3593 


TESTI201 98540 


1624 


21. .1553 


3594 


TESTI201 98600 


1625 


909.. 1328 


3595 


TESTI20199110 


1626 


622..1617 


3596 


TESTI20 199980 


1627 


764.. 11 02 


3597 


TESTI20200120 


1628 


661. .966 


3598 


TESTI20200840 


1629 


378.. 81 8 


3599 


TESTI20201760 


1630 


22.. 603 


3600 


TESTI20202830 


1631 


631.. 972 


3601 


TESTI20204260 


1632 


714.. 1406 


3602 


TESTI20205100 


1633 


270.. 1763 


3603 


TESTI20205150 


1634 


1286.. 1663 


3604 
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Clone name 


SEQ ID NO. of nucleotide 


Position of CDS 


SEQ ID NO. of amino acid 




sequence 




sequence 


TESTI20205250 


1635 


32.. 1888 


3605 


TESTI20207170 


1636 


1.795 


3606 


TESTI20209050 


1637 


269. .1900 


3607 


TESTI20210030 


1638 


328. .675 


3608 


TESTI20210570 


1639 


280.. 1362 


3609 


TESTI20211380 


1640 


372.. 848 


3610 


TESTI20212970 


1641 


93. .21 02 


3611 


TESTI20214630 


1642 


690. .1106 


3612 


TESTI20215310 


1643 


294.. 899 


3613 


TESTI20219110 


1644 


982.. 1344 


3614 


TESTI20219390 


1645 


253.. 573 


3615 


TESTI20220230 


1646 


11 6. .967 


3616 


TESTI 20221 790 


1647 


109.. 1830 


3617 


TESTI20222030 


1648 


277.. 2277 


3618 


TESTI20222460 


1649 


162.. 1889 


3619 


TESTI20223380 


1650 


322.. 831 


3620 


TESTI20226520 


1651 


178.. 1392 


3621 


TESTI20227380 


1652 


20.. 1747 


3622 


TESTI20228120 


1653 


31 ..858 


3623 


TESTI20228740 


1654 


607 1026 


3624 


TESTI 20244220 


1655 


757.. 2448 


3625 


TESTI20244430 


1656 


1 7.. 1444 


3626 


TESTI20244460 


1657 


684. .1511 


3627 


TESTI20244730 


1658 


127.. 1044 


3628 


TESTI20245600 


1659 


255 1523 


3629 


TESTI20245860 


1660 


684 1058 


3630 


TESTI2024641 0 


1661 


1513 1818 


3631 


TESTI20246480 


1662 


380 1399 


3632 


TESTI 20247440 


1663 


425.. 907 


3633 


TESTI20248850 


1664 


3..1 829 


3634 


TESTI20249360 


1665 


600 1 565 


3635 


TESTI20250220 


1666 




3636 


TESTI20250630 


1667 


48? 1 690 


3637 


TFSTI20251440 


1 668 


42.. 1 271 


3638 


TFST120251 61 0 


1669 


HOU . . 1 £ I O 


3639 


1 1 — 1 1 £— \J 1 / 4U 


1 670 


/ OU ..II 


3640 


TESTI20252690 


1671 


1182..2138 


3641 


TESTI20254030 


1672 


363.. 1958 


3642 


TESTI20254090 


1673 


878.. 1360 


3643 


TESTI20254480 


1674 


320.. 1390 


3644 


TESTI20254990 


1675 


693.. 1799 


3645 


TESTI20255460 


1676 


152. .2710 


3646 


TESTI20256560 


1677 


169..>2257 


3647 


TESTI20257910 


1678 


457.. 1311 


3648 


TESTI20258720 


1679 


38.. 1630 


3649 


TESTI20259110 


1680 


486.. 1805 


3650 


TESTI20259200 


1681 


227.. 1369 


3651 


TESTI20260140 


1682 


1113. .1421 


3652 


TESTI20260640 


1683 


6.. 1109 


3653 
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Clone name 


SEQ ID NO. of nucleotide 


Position of CDS 


SEQ ID NO. of amino acid 




sequence 




sequence 


TESTI20261040 


1684 


14. .970 


3654 


TESTI20261160 


1685 


257.. 868 


3655 


TESTI20261680 


1686 


1422. .1901 


3656 


TESTI20262150 


1687 


315. .2765 


3657 


TESTI20262940 


1688 


255.. 1574 


3658 


TESTI20264530 


1689 


150. .1580 


3659 


TESTI20264910 


1690 


1638.. 2378 


3660 


TESTI202651 50 


1691 


1023 1409 


3661 


TESTI20265340 


1692 


965.. 1552 


3662 


TESTI20265890 


1693 


994.. 1308 


3663 


TESTI20266050 


1694 


116. .1522 


3664 


TESTI20268240 


1695 


70.. 1638 


3665 


TESTI20269250 


1696 


17..>1926 


3666 


TESTI20269360 


1 697 


682 1 005 


3667 


TESTI202701 30 


1 698 


99. .512 


3668 


TESTI20271 790 


1699 


101 ..1822 


3669 


TESTI20272380 


1 700 


121 ..492 


3670 


TESTI20274960 


1701 


619 1716 


3671 


TESTI20277300 


1 702 


263.. 1 600 


3672 


TEST120278280 


1703 


80. .607 


3673 


TESTI20282420 


1704 


52. .528 


3674 


TESTI20282530 


1705 


699 1 985 


3675 


TESTI20282900 


1 706 


30. .341 


3676 


TESTI20284260 


1707 


29.. 493 


3677 


TESTI20285230 


1708 


1710 2366 


3678 


TESTI20286590 


1709 


519. .1121 


3679 


TESTI20287760 


1710 


775 26^4 


3680 


THYMU 1 0004280 


1711 


617 988 


3681 


THYMU20006020 

■ III IVIUcUUUUUcU 


1712 


608 1?R7 

UUO . . 1 JO / 


3682 


THYMU20007020 


1713 


17*^ 2111 


OUOv) 


THYMU20007750 


1714 


1077 14QQ 


3684 


THYMU20008000 


1715 


1506 19^ 


3685 


THYMU2000Q460 

till IVIUCUUUQtUU 


1716 


1 *V34 1 72*3 


3686 


THYMUPOOOQ^OO 


1717 


717 9117 


ODD / 


THYMl IP000Q71 0 


1718 


. . / O^l 


ODOO 


THYMl J2001 01 80 


1 71 9 


1Q1 


oooy 


THYMU20010710 


1720 


850. .1164 


3690 


THYMU20012020 


1721 


1160.. 1561 


3691 


THYMU20012560 


1722 


47..517 


3692 


THYMU20013250 


1723 


131. .1309 


3693 


THYMU20013810 


1724 


196. .927 


3694 


THYMU20014430 


1725 


1280.. 1678 


3695 


THYMU20017270 


1726 


16.. 660 


3696 


THYMU20018250 


1727 


1205.. 21 22 


3697 


THYMU20018390 


1728 


237.. 1373 


3698 


THYMU20019000 


1729 


405.. 758 


3699 


THYMU20019260 


1730 


317..736 


3700 


THYMU20020370 


1731 


1332.. 1733 


3701 


THYMU20020800 


1732 - 


600.. 1196 


3702 
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Clone name 


SEQ ID NO. of nucleotide 


Position of CDS 


SEQ ID NO. of amino acid 




sequence 




sequence 


THYMU20021090 


1733 


211. .1281 


3703 


THYMU20021540 


1734 


100. .1578 


3704 


THYMU20023560 


1735 


1429.. 1884 


3705 


THYMU20024500 


1736 


199. .1584 


3706 


THYMU20025480 


1737 


1362. .1751 


3707 


THYMU20026950 


1738 


9. .920 


3708 


THYMU20028150 


1739 


233. 1222 


3709 


THYMU20028410 


1740 


10.. 1086 


3710 


THYMU20029830 


1741 


46.. 456 


3711 


THYMU20030460 


1742 


453..1175 


3712 


THYMU20030690 


1743 


1658. .2269 


3713 


THYMU20031330 


1744 


90. .569 


3714 


THYMU20032820 


1745 


250..1176 


3715 


THYMU20034400 


1746 


791 .1123 


3716 


THYMU20034790 


1747 


120. 452 


3717 


THYMU20036500 


1748 


56. .1657 


3718 


THYMU20039320 


1749 


584.. 1339 


3719 


THYMU20043440 


1750 


1301 .1615 


3720 


THYMU20043560 


1751 


292.. 621 


3721 


THYMU20044100 


1752 


1 800.. 2228 


3722 


THYMU20044520 


1753 


1387.. 1899 


3723 


THYMU20046350 


1754 


1863. .2351 


3724 


THYMU20046770 


1755 


321 ..671 


3725 


THYMU20049060 


1756 


172.. 732 


3726 


THYMU2005001 0 


1757 


206. .1681 


3727 


THYMU20051340 


1758 


886. .1209 


3728 


THYMU20052460 


1759 


1300. .1 683 


3729 


THYMU20052830 


1760 


92.. 1504 


3730 


THYMU20054800 


1761 


479.. 880 


3731 


THYMU20055450 


1762 


11 2.. 426 


3732 


THYMU20055460 

till IVIUt.UV'JUT^Uv 


1763 


20. .361 


3733 


THYMU20055740 


1764 


901. .1341 


3734 


THYMU20055760 


1765 


232.. 738 


3735 


THYMU20058550 


1766 


1170 1493 

1 1 />»/»» 1 ^T^W 


3736 


THYMU20060480 


1767 


2046 >2453 


3737 


THYMU20062520 

till Iwl w t.UUUt.<Jt.U 


1768 


299 655 


3733 


THYMU20062610 


1769 


136.. 1953 


3739 


THYMU20062770 


1770 


381. .1199 


3740 


THYMU20063650 


1771 


264.. 800 


3741 


THYMU20064680 


1772 


253.. 621 


3742 


THYMU20066660 


1773 


405.1487 


3743 


THYMU20069130 


1774 


341. .961 


3744 


THYMU20069460 


1775 


231. .575 


3745 


THYMU20069650 


1776 


728.. 1036 


3746 


THYMU20070250 


1777 


81. .1703 


3747 


THYMU20071120 


1778 


170.. 1792 


3748 


THYMU20071460 


1779 


152.. 499 


3749 


THYMU20072580 


1780 


1981.. 2286 


3750 


THYMU20073070 


1781 


551. .1225 


3751 
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Clone name 


SEQ ID NO. of nucleotide 


Position of CDS 


SEQ ID NO. of amino acid 




sequence 




sequence 


THYMU20073080 


1782 


755.. 1198 


3752 


THYMU20077250 


1783 


645.. 1241 


3753 


THYMU20078020 


1784 


1290. .1598 


3754 


THYMU20078240 


1785 


1506.. 1838 


3755 


THYMU20079690 


1786 


1049.. 1714 


3756 


THYMU20080490 


1787 


57..398 


3757 


THYMU20081110 


1788 


629.. 1246 


3758 


THYMU20083390 


1789 


592.. 924 


3759 


THYMU20083500 


1790 


744.. 1067 


3760 


THYMU20083830 


1791 


155.. 1786 


3761 


THYMU20084520 


1792 


286. 597 


3762 


THYMU20086430 


1793 


769.. 1077 


3763 


THYMU20087270 


1794 


842.. 2008 


3764 


THYMU20089170 


1795 


305. .619 


3765 


THYMU20089900 


1796 


1008. .1421 


3766 


THYMU20090230 


1797 


106.. 594 


3767 


THYMU20091040 


1798 


728.. 1054 


3768 


THYMU20095920 


1799 


1809.. 2378 


3769 


THYMU20096580 


1800 


120. 692 


3770 


THYMU20097920 


1801 


95.. 2086 


3771 


THYMU20098350 


1802 


99. .1802 


3772 


THYMU20099060 


1803 


80. .1573 


3773 


THYMU201 00940 


1804 


8. 221 8 


3774 


THYMU201 04480 


1805 


39. .1451 


3775 


THYMU201 06990 


1806 


673.. 1464 


3776 


THYMU201 10720 


1807 


372.. 1070 


3777 


THYMU201 12570 


1808 


11 7.. 455 


3778 


THYMU201 12590 


1809 


112. .1827 


3779 


THYMU201 15380 


1810 


1507.. 1839 


3780 


THYMU201 15730 


1811 


943.. 1632 


3781 


THYMU201 17850 


1812 


1625.. 1948 


3782 


THYMU201 20240 


1813 


1239. .3002 


3783 


THYMU201 20730 


1814 


1257. .2459 


3784 


THYMU20121040 


1815 


230.. 2323 


3785 


THYMU201 28910 


1816 


181 ..1185 


3786 


THYMU201 29020 


1817 


1545..>2019 


3787 


THYMU201 30470 


1818 


87.. 1169 


3788 


THYMU201 34260 


1819 


1845..2171 


3789 


THYMU201 37050 


1820 


958. .1341 


3790 


THYMU201 37570 


1821 


21 13. .2571 


3791 


THYMU20139160 


1822 


69.. 686 


3792 


THYMU201 40510 


1823 


1312.. 1626 


3793 


THYMU201 43230 


1824 


575.. 2023 


3794 


THYMU201 45990 


1825 


1330.. 2262 


3795 


THYMU20148010 


1826 


326.. 820 


3796 


THYMU20 149230 


1827 


581.. 925 


3797 


THYMU20150190 


1828 


1516. .1908 


3798 


THYMU20151610 


1829 


218.. 1663 


3799 


THYMU201 53210 


1830 


1135..2340 


3800 
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SEQ ID NO. of nucleotide 


Position of CDS 


SEQ ID NO. of amino acid 




sequence 




sequence 


THYMU201 54790 


1831 


1220.. 1753 


3801 


THYMU201 57620 


1832 


771.. 1487 


3802 


THYMU201 63600 


1833 


1216.. 1650 


3803 


THYMU201 70080 


1834 


944.. 1399 


3804 


THYMU201 70230 


1835 


835.. 1941 


3805 


THYMU20171580 


1836 


821. .1150 


3806 


THYMU201 74490 


1837 


52.. 453 


3807 


THYMU201 74790 


1838 


29.. 370 


3808 


THYMU201 75260 


1839 


991. .1452 


3809 


THYMU201 76010 


1840 


12.. 2039 


3810 


THYMU201 77070 


1841 


1059.. 1415 


3811 


THYMU201 78440 


1842 


81. .1517 


3812 


THYMU20181890 


1843 


102.. 1652 


3813 


THYMU201 84550 


1844 


171.. 1886 


3814 


THYMU201 85470 


1845 


161. .817 


3815 


THYMU201 85650 


1846 


36. .536 


3816 


THYMU201 87210 


1847 


190..507 


3817 


THYMU20191970 


1848 


230.. 1597 


3818 


TKIDN1 0000620 


1849 


1075.. 1419 


3819 


TKIDN10001710 


1850 


1685.. 2365 


3820 


TKIDN10001920 


1851 


98.. 793 


3821 


TRACH20011010 


1852 


228. 2033 


3822 


TRACH20011540 


1853 


125. .478 


3823 


TRACH20012490 


1854 


83.. 1600 


3824 


TRACH20021000 


1855 


234.. 854 


3825 


TRACH20021380 


1856 


313.. 1890 


3826 


TRACH20025370 


1857 


947.. 1456 


3827 


TRACH20026640 


1858 


964.. 1557 


3828 


TRACH20029880 


1859 


201.. 1607 


3829 


TRACH20040390 


1860 


125. .331 9 


3830 


TRACH20041090 


1861 


1254.. 1589 


3831 


TRACH20043360 


1862 


30.. 2066 


3832 


TRACH20044990 


1863 


209.. 790 


3833 


TRACH20049500 


1864 


840..1166 


3834 


TRACH20051 590 


1865 


927.. 1247 


3835 


TRACH20057200 


1866 


2082.. 2393 


3836 


TRACH20058000 


1867 


187.. 1647 


3837 


TRACH20073990 


1868 


98.. 454 


3838 


TRACH20080810 


1869 


2073..>2910 


3839 


TRACH20081270 


1870 


1681..>2025 


3840 


TRACH20090060 


1871 


309.. 1781 


3841 


TRACH20091070 


1872 


129.. 1841 


3842 


TRACH20093400 


1873 


131.. 2251 


3843 


TRACH20093480 


1874 


694.. 1092 


3844 


TRACH20098510 


1875 


142. .1944 


3845 


TRACH20101590 


1876 


1660.. 2220 


3846 


TRACH201 04510 


1877 


144.. 1184 


3847 


TRACH201 08240 


1878 


36..791 


3848 


TRACH201 13020 


1879 


177.. 1352 


3849 
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SEQ ID NO. of nucleotide 


Position of CDS 


SEQ ID NO. of amino acid 




sequence 




sequence 


TRACH201 22980 


1880 


732.. 1562 


3850 


TRACH201 23870 


1881 


235.. 540 


3851 


TRACH201 24970 


1882 


109..486 


3852 


TRACH201 25620 


1883 


22. 387 


3853 


TRACH20129180 


1884 


1545. .1898 


3854 


TRACH20131230 


1885 


123.. 2939 


3855 


TRACH201 39280 


1886 


152. .1474 


3856 


TRACH20140180 


1887 


203.. 553 


3857 


TRACH201 43710 


1888 


1219.. 1998 


3858 


TRACH201 49500 


1889 


163.. 906 


3859 


TRACH201 49720 


1890 


187.1695 


3860 


TRACH201 49740 


1891 


864 .1505 


3861 


TRACH201 58240 


1892 


254.. 796 


3862 


TRACH201 59390 


1893 


1618.. 1941 


3863 


TRACH2016 


1894 


1223. .1531 


3864 


TRACH20 163470 


1895 


971.. 2356 


3865 


TRACH20164100 


1896 


372. .722 


3866 


TRACH201 64810 


1897 


1685. .21 04 


3867 


TRACH201 65330 


1898 


704.1 102 


3868 


TRACH201 65540 


1899 


810.. 1514 


3869 


TRACH201 67090 


1900 


57.. 1148 


3870 


TRACH201 70860 


1901 


186. 1625 


3871 


TRACH201 73680 


1902 


83.. 1597 


3872 


TRACH201 74980 


1903 


93.. 1088 


3873 


TRACH201 82780 


1904 


1669.. 2070 


3874 


TRACH2018512 


1905 


13.. 339 


3875 


TRACH201 88350 


1906 


248.. 1015 


3876 


TRACH201 90460 


1907 


351. .1526 


3877 


UMVEN 10001 380 


1908 


21 22.. 2436 


3878 


UTERU10001060 


1909 


187.. 552 


3879 


UTERU1 0001 870 


1910 


1681. .2031 


3880 


UTERU20000230 


1911 


120..608 


3881 


UTERU20000950 


1912 


240.. 1658 


3882 


UTERU20011760 


1913 


171 9.. 2090 


3883 


UTERU20013890 


1914 


1544.. 1951 


3884 


UTERU20016580 


1915 


25. .1287 


3885 


UTERU20026620 


1916 


1020.. 1556 


3886 


UTERU20027360 


1917 


1624.. 1992 


3887 


UTERU20029930 


1918 


82.. 1902 


3888 


UTERU20031350 


1919 


35.. 520 


3889 


UTERU20035770 


1920 


920.. 1282 


3890 


UTERU20040150 


1921 


917.. 1717 


3891 


UTERU20040370 


1922 


22.. 1866 


3892 


UTERU20040390 


1923 


727. 1200 


3893 


UTERU20040730 


1924 


1009..>2303 


3894 


UTERU20041630 


1925 


105. 1328 


3895 


UTERU20041970 


1926 


26.. 484 


3896 


UTERU20045200 


1927 


6..917 


3897 


UTERU20051790 


1928 


415.. 1341 


3898 
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Table 1 (continued) 



5 



10 



15 



20 



25 



Clone name 


SEQ ID NO. of nucleotide 
sequence 


Position of CDS 


SEQ ID NO. of amino acid 
sequence 


UTERU20064120 


1929 


147.10~61 


3899 


UTERU20065470 


1930 


389. .694 


3900 


UTERU20079240 


1931 


408.. 71 3 


3901 


UTERU20083020 


1932 


23.. 826 


3902 


UTERU20086530 


1933 


1274.. 1783 


3903 


UTERU20087070 


1934 


1274.. 2467 


3904 


UTERU20087850 


1935 


464.. 991 


3905 


UTERU20089300 


1936 


906 1253 


3906 


UTERU20089390 


1937 


11 ..1729 


3907 


UTERU20089620 


1938 


132.. 725 


3908 


UTERU20090940 


1939 


122. .631 


3909 


UTERU20091 470 


1940 


260. .697 


3910 


UTERU20094830 


1941 


127.. 1578 


3911 


UTERU200951 00 


1942 


1593. .1913 


3912 


UTERU20099040 


1943 


59. .883 


3913 


UTERU2009951 0 


1944 


705 2375 


3914 


UTERU201 01150 


1945 


874 1188 


3915 


UTERU201 02260 


1946 


270. .671 


3916 


UTERU201 03040 


1947 


11 64.. 1772 


3917 


UTERU201 03200 


1948 


374.. 709 


3918 


UTERU201 04310 


1 949 


90. .425 


3919 


UTERU201 0651 0 


1 950 


1649 2275 


3920 


UTERU201 21 1 40 


1 951 


880 1644 


3921 


UTERU201 22520 


1 952 


572 >2146 


3922 


UTFRU201 25810 


1953 


21 ..329 


3923 


UTERU201 27030 


1 954 


618 1763 


3924 


UTERU20127150 


1955 


707 1699 


3925 


UTE RU20 1 28560 


1956 


203.. 721 


3926 


UTERU201 32620 


1957 


330 1007 


3927 


UTERU201 34830 


1958 


121. .1341 


3928 


UTERU201 39760 


1959 


982.. 1659 


3929 


UTERU201 40010 


1960 


42. .368 


3930 


UTERU201 67570 


1961 


57.. 710 


3931 


UTERU201 68960 


1962 


317.. 1468 


3932 


UTE RU20 169020 


1963 


30.. 782 


3933 


UTERU201 73030 


1964 


1985..>2355 


3934 


UTERU201 76230 


1965 


1004.. 1372 


3935 


UTERU201 77150 


1966 


3. 350 


3936 


UTERU20181270 


1967 


153. .620 


3937 


UTERU201 85220 


1968 


521.. 1357 


3938 


UTERU201 88670 


1969 


231. 1886 


3939 


UTE RU20 188840 


1970 


1834..2181 


3940 



[0028] Namely, primers used to synthesize polynucleotides can be designed based on the nucleotide sequences of 
polynucleotides of the present invention shown in SEQ ID NOs in the above Table 1 . When one intends to synthesize 
full-length cDNAs, an oligo dT primer can be used as the 3'-end primer. The length of the primers is usually 15-100 
bp, and favorably between 15-35 bp. In case of LA PCR, which is described below, the primer length of 25-35 bp may 
provide a good result. 

[0029] A method to design a primer that enables a specific amplification based on the aimed nucleotide sequence 
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is known to those skilled in the art (Current Protocols in Molecular Biology, Ausubel et al. edit, (1987) John Wiley & 
Sons, Section 6.1-6.4). In designing a primer based on the 5'-end sequence, the primer is designed so as that in 
principle, the amplification products will include the translation start site. Accordingly, for example, when the 5'-end 
primer is designed based on the nucleotide sequence of 5* untranslated region (5'UTR), any part of the 5'-end, which 

5 ensures the specificity to the cDNA of interest, can be selected as the primer. 

[0030] When synthesizing a full-length cDNA, the target nucleotide sequence to be amplified can extend to several 
thousand bp in some cDNA. However, it is possible to amplify such a long nucleotides by using such as LA PCR (Long 
and Accurate PCR). It is advantageous to use LA PCR when synthesizing long DNA. In LA PCR, in which a special 
DNA polymerase having 3' -> 5' exonuclease activity is used, misincorporated nucleotides can be removed. Accordingly, 

10 accurate synthesis of the complementary strand can be achieved even with a long nucleotide sequence. By using LA 
PCR, it is reported that amplification of a nucleotide with 20 kb longer can be achieved under desirable conditions 
(Takeshi Hayashi (1996) Jikken-lgaku Bessatsu, "Advanced Technologies in PCR" Youdo-sha). 
[0031 ] A template DNA for synthesizing the full-length cDNA of the present invention can be obtained by using cDNA 
libraries that are prepared by various methods. The full-length cDNA clones of the present invention are clones with 

15 high probability of completeness in length, which were obtained by the method comprising the steps of [1] preparing 
libraries containing cDNAs with the very high fullness ratio by oligo-capping, and [2] assembling the 5'-end sequences 
and selecting one with the highest probability of completeness in length in the cluster formed (there are many clones 
longer in the 5'-end direction). 

[0032] However, the uses of primers designed based on the full-length nucleotide sequences provided by the present 

20 invention enable easily obtaining full-length cDNAs without such a special technique. 

[0033] The problem with the cDNA libraries prepared by the known methods or commercially available is that mRNA 
contained in the libraries has very low fullness ratio. Thus, it is difficult to screen full-length cDNA clone directly from 
the library using ordinary cloning methods. The present invention has revealed a nucleotide sequence of novel full- 
length cDNA. If a full-length nucleotide sequence is provided, it is possible to synthesize a target full-length cDNA by 

25 using enzymatic reactions such as PCR. In particular, a full-length-enriched cDNA library, synthesized by methods 
such as oligo-capping, is desirable to synthesize a full-length cDNA with more reliability. 

[0034] The 5'-end sequence of the full-length cDNA clones of the invention can be used to isolate the regulatory 
element of transcription including the promoter on the genome. A rough draft of the human genome (analysis of human 
genomic sequence with lower accuracy), which covers 90% of the genome, has been reported (Nature, Vol.409, 

30 814-823, 2001), and by the year 2003, analysis of the entire human genomic sequence is going to be finished. However, 
it is hard to analyze with software the transcription start sites on the human genome, in which long introns exist. By 
contrast, it is easy to specify the transcription start site on the genomic sequence using the nucleotide sequence which 
includes the 5'-end of the full-length cDNA clone of the present invention, and thus it is easy to obtain the genomic 
region involved in transcription regulation, which includes the promoter that is contained in the upstream of the tran- 

35 scription start site. 

[0035] The polypeptide encoded by the full-length cDNA of the invention can be prepared as a recombinant polypep- 
tide or as a natural polypeptide. For example, the recombinant polypeptide can be prepared by inserting the polynu- 
cleotide encoding the polypeptide of the invention into a vector, introducing the vector into an appropriate host cell and 
purifying the polypeptide expressed within the transformed host cell, as described below. In contrast, the natural 

40 polypeptide can be prepared, for example, by utilizing an affinity column to which an antibody against the polypeptide 
of the invention (Current Protocols in Molecular Biology (1987) Ausubel et al. edit, John Wiley & Sons, Section 
16.1-16.1 9) is attached. The antibody used for affinity purification may be either a polyclonal antibody, or a monoclonal 
antibody. Alternatively, in vitro translation (See, for example, "On the fidelity of mRNA translation in the nuclease-treated 
rabbit reticulocyte lysate system." Dasso M.C., and Jackson R.J. (1989) Nucleic Acids Res. 17: 3129-3144) may be 

45 used for preparing the polypeptide of the invention. 

[0036] Polypeptides functionally equivalent to the polypeptides of the present invention can be prepared based on 
the activities, which were clarified in the above-mentioned manner, of the polypeptides of the present invention. Using 
the biological activity possessed by the polypeptide of the invention as an index, it is possible to verify whether or not 
a particular polypeptide is functionally equivalent to the polypeptide of the invention by examining whether or not the 

50 polypeptide has said activity. 

[0037] Polypeptides functionally equivalent to the polypeptides of the present invention can be prepared by those 
skilled in the art, for example, by using a method for introducing mutations into an amino acid sequence of a polypeptide 
(for example, site-directed mutagenesis (Current Protocols in Molecular Biology, edit, Ausubel et al. , (1 987) John Wiley 
& Sons, Section 8.1-8.5). Besides, such polypeptides can be generated by spontaneous mutations. The present in- 

55 vention also includes a polypeptide comprising the amino acid sequence shown in Table 1 in which one or more amino 
acids are substituted, deleted, inserted, and/or added, as long as the polypeptides have the equivalent functions to 
those of the polypeptides identified in the present Examples described later. 

[0038] There are no limitations on the number and sites of amino acid mutations, as long as the polypeptides maintain 
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the functions thereof. The number of mutations typically corresponds to 30% or less, or 20% or less, or 1 0% or less, 
preferably 5% or less, or 3% or less of the total amino acids, more preferably 2% or less or 1% or less of the total 
amino acids. Alternatively, herein, substitution of one or more amino acids includes substitution of several amino acids. 
As used herein, the term "several amino acids" means, for example, 5 amino acids, preferably 4 or 3 amino acids, 

5 more preferably 2 amino acids, and further preferably 1 amino acid. 

[0039] From the viewpoint of maintaining the polypeptide function, it is preferable that a substituted amino acid has 
a similar property to that of the original amino acid. For example, Ala, Val, Leu, lie, Pro, Met, Phe and Trp are assumed 
to have similar properties to one another because they are all classified into a group of non-polar amino acids. Similarly, 
substitution can be performed among non-charged amino acid such as Gly, Ser, Thr, Cys, Tyr, Asn, and Gin, acidic 

w amino acids such as Asp and Glu, and basic amino acids such as Lys, Arg, and His. 

[0040] In addition, polypeptides functionally equivalent to the polypeptides of the present invention can be isolated 
by using techniques of hybridization or gene amplification known to those skilled in the art. Specifically, using the 
hybridization technique (Current Protocols in Molecular Biology, edit, Ausubel et al., (1987) John Wiley & Sons, Section 
6.3-6.4)), those skilled in the art can usually isolate a polynucleotide highly homologous to the polynucleotide encoding 

15 the polypeptide identified in the present Example based on the identified nucleotide sequence (Table 1) or a portion 
thereof and obtain the functionally equivalent polypeptide from the isolated polynucleotide. The present invention in- 
clude polypeptides encoded by the polynucleotides hybridizing with the polynucleotides encoding the polypeptides 
identified in the present Example, as long as the polypeptides are functionally equivalent to the polypeptides identified 
in the present Example. Organisms from which the functionally equivalent polypeptides are isolated are illustrated by 

20 vertebrates such as human, mouse, rat, rabbit, pig and bovine, but are not limited to these animals. 

[0041] Washing conditions of hybridization for the isolation of polynucleotides encoding the functionally equivalent 
polypeptides are usually "1x SSC, 0.1% SDS, 37°C"; more stringent conditions are "0.5x SSC, 0.1% SDS, 42°C"; and 
still more stringent conditions are "O.lxSSC, 0.1% SDS, 65°C". Alternatively, the following conditions can be given as 
hybridization conditions of the present invention. Namely, conditions in which the hybridization is done at "6x SSC, 

25 40% Formamide, 25°C", and the washing at "1x SSC, 55°C" can be given. More preferable conditions are those in 
which the hybridization is done at "6x SSC, 40% Formamide, 37°C", and the washing at "0.2x SSC, 55°C". Even more 
preferable are those in which the hybridization is done at "6x SSC, 50% Formamide, 37°C'\ and the washing at "0.1 x 
SSC, 62°C". The more stringent the conditions of hybridization are, the more frequently the polynucleotides highly 
homologous to the probe sequence are isolated. Therefore, it is preferable to conduct hybridization under stringent 

30 conditions. Examples of stringent conditions in the present invention are, washing conditions of "0.5x SSC, 0.1% SDS, 
42°C", or alternatively, hybridization conditions of "6x SSC, 40% Formamide, 37°C", and the washing at "0.2x SSC, 
55°C'\ 

[0042] One skilled in the art can suitably select various conditions, such as dilution ratios of SSC, formamide con- 
centrations, and temperatures to accomplish a similar stringency. 
35 [0043] However, the above-mentioned combinations of SSC, SDS and temperature conditions are indicated just as 
examples. Those skilled in the art can select the hybridization conditions with similar stringency to those mentioned 
above by properly combining the above-mentioned or other factors (for example, probe concentration, probe length 
and duration of hybridization reaction) that determines the stringency of hybridization. 

[0044] The amino acid sequences of polypeptides isolated by using the hybridization techniques usually have high 

40 identity to those of the polypeptides of the present invention, which are shown in Table 1. The present invention en- 
compasses a polynucleotide comprising a nucleotide sequence that has a high identity to the nucleotide sequence of 
claim 1 (a). Furthermore, the present invention encompasses a peptide, or polypeptide comprising an amino acid 
sequence that has a high identity to the amino acid sequence encoded by the polynucleotide of claim 1 (b). The term 
"high identity" indicates sequence identity of at least 40% or more; preferably 60% or more; and more preferably 70% 

45 or more. Alternatively, more preferable is identity of 90% or more, or 93% or more, or 95% or more, furthermore, 97% 
or more, or 99% or more. The identity can be determined by using the BLAST search algorithm. 
[0045] As used herein, "percent identity" of amino acid sequences or nucleic acids is determined using the algorithm 
BLAST of Karlin and Altschul (Proc. Natl. Acad. Sci. USA 90:5873-5877, 1 993). Such an algorithm is incorporated into 
the BLASTN and BLASTX programs of Altschul et al. (J. Mol. Biol.215:403-410, 1990). BLAST nucleotide searches 

50 are performed with the BLASTN program, for example, score = 100, wordlength = 12. BLAST protein searches are 
performed with the BLASTX program, for example, score = 50, wordlength = 3. When utilizing BLAST and Gapped 
BLAST programs, the default parameters of the respective programs are used. See http://www.ncbi.nlm.nih.gov. 
[0046] With the gene amplification technique (PCR) (Current Protocols in Molecular Biology, edit, Ausubel et al., 
(1987) John Wiley & Sons, Section 6.1-6.4)) using primers designed based on the nucleotide sequence (Table 1) or a 

55 portion thereof identified in the present Example, it is possible to isolate a polynucleotide fragment highly homologous 
to the polynucleotide sequence or a portion thereof and to obtain functionally equivalent polypeptide to a particular 
polypeptide identified in the present Example based on the isolated polynucleotide fragment. 

[0047] The present invention also provides a polynucleotide containing at least 15 nucleotides complementary to a 
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polynucleotide comprising a nucleotide sequence of SEQ ID NOs shown In Table 1 or the complementary strand thereof . 
Herein, the term "complementary strand" is defined as one strand of a double strand DNA composed of A:T and G:C 
base pair to the other strand. Also, "complementary" is defined as not only those completely matching within a contin- 
uous region of at least 15 nucleotides, but also having a identity of at least 70%, favorably 80% or higher, more favorably 
5 90% or higher, and most favorably 95% or higher within that region. The identity may be determined using the algorithm 
described herein. 

[0048] Such a polynucleotide includes probes and primers used for the detection and amplification of a polynucleotide 
encoding the inventive polypeptide. When used as a primer, the polynucleotide usually comprises 15 to 100 bp, and 
preferably of 15 to 35 bp. When used as a probe, the polynucleotide comprises the whole or a part of the sequence 

10 of a polynucleotide of the invention, and comprises at least 15 bp. When used as primers, such polynucleotides are 
complementary at the 3'-end, and restriction enzyme recognition sequences or tags can be added to the 5'-end. 
[0049] Furthermore, polynucleotides of the present invention include an antisense polynucleotide for suppressing 
the expression of a polypeptide of the invention, which comprises an amino acid sequence of SEQ ID NOs shown in 
Table 1. To exert an antisense effect, an antisense polynucleotide has at least 15 bp or more, for example 50 bp or 

is more, preferably 1 00 bp or more, and more preferably 500 bp or more, and usually has 3000 bp or less, and preferably 
2000 bp or less. Antisense polynucleotides can be used in the gene therapy of diseases caused by abnormalities of 
the polypeptides of the invention (abnormal function or abnormal expression). An antisense polynucleotide can be 
prepared, for example, by the phosphorothioate method ("Physicochemical properties of phosphorothioate oligodeox- 
ynucleotides." Stein (1988) Nucleic Acids Res. 16: 3209-3221) based on the sequence information of polynucleotide 

20 encoding a polypeptide of the invention (for example, the nucleotide sequences of SEQ ID NO: 1 to 1970). 

[0050] The polynucleotides or antisense polynucleotides of the present invention can be used in, for example, gene 
therapy. As target diseases, for example, cancers or various inflammatory diseases may be preferable. These mole- 
cules can be used for gene therapy, for example, by administrating them to patients by the in vivo or ex vivo method 
using virus vectors such as retrovirus vectors, adenovirus vectors, and adeno-related virus vectors, or non-virus vectors 

25 such as liposomes. 

[0051] The present invention also includes a partial peptide of the polypeptides of the invention. The partial peptide 
comprises a polypeptide generated as a result that a signal peptide has been removed from a secretory protein. If the 
polypeptide of the present invention has an activity as a receptor or a ligand, the partial peptide may function as a 
competitive inhibitor of the polypeptide and may bind to the receptor (or ligand). In addition, the present invention 

30 includes an antigen peptide for raising antibodies. For the peptides to be specific for the polypeptide of the invention, 
the peptides comprise at least 7 amino acids, preferably 8 amino acids or more, more preferably 9 amino acids or 
more, and even more preferably 10 amino acids or more. The peptide can be used for preparing antibodies against 
the polypeptide of the invention, or competitive inhibitors of them, and also screening for a receptor that binds to the 
polypeptide of the invention. The partial peptides of the invention can be produced, for example, by genetic engineering 

35 methods, known methods for synthesizing peptides, or digesting the polypeptide of the invention with an appropriate 
peptidase. 

[0052] The present invention also relates to a vector into which a polynucleotide of the invention is inserted. The 
vector of the invention is not limited as long as it contains the inserted polynucleotide stably. For example, if E. coli is 
used as a host, vectors such as pBluescript vector (Stratagene) are preferable as a cloning vector. To produce the 

40 polypeptide of the invention, expression vectors are especially useful. Any expression vector can be used as long as 
it is capable of expressing the polypeptide in vitro, in E. coli, in cultured cells, or in vivo. For example, pBEST vector 
(Promega) is preferable for in vitro expression, pET vector (Invitrogen) for E. coli, pME18S-FL3 vector (GenBank Ac- 
cession No. AB009864) for cultured cells, and pME18S vector (Mol. Cell. Biol. (1988) 8: 466-472) for/n vivo expression. 
To insert the polynucleotide of the invention, ligation utilizing restriction sites can be performed according to the standard 

45 method (Current Protocols in Molecular Biology (1987) Ausubel et al. edit, John Wiley & Sons, Section 11 .4-11.11). 
[0053] Recently, the technique of GATEWAY™ system (Invitrogen), which is an expression vector construction sys- 
tem for polypeptide expression, has been developed (Experimental Medicine, Vol. 1 8, No. 1 9 (December), p271 6-271 7 : 
2000). This system includes two types of site-specific recombinases (BP CLONASE™ and LR CLONASE™) derived 
from lambda phage and uses BP CLONASE™-specific recombination sites for an Entry Vector and LR CLONASE™- 

50 specific recombination sites for a Destination Vector, which may comprise a tag useful for polypeptide purification. With 
this system, an expression vector can be obtained by using homologous recombination. 

[0054] First, a polynucleotide fragment of interest is inserted into the entry vector using the first recombination. Then, 
the secondary recombination is allowed to take place between the entry vector, where the polynucleotide fragment of 
interest has been inserted, and the destination vector. Thus, the expression vector can be prepared rapidly and highly 
55 efficiently. With the above-mentioned typical method using restriction enzyme and ligase reactions, the step of expres- 
sion vector construction and expression of polypeptide of interest takes about 7 to 1 0 days. However, with the GATE- 
WAY™ system, the polypeptide of interest can be expressed and prepared in only 3 to 4 days. Thus, the system 
ensures a high-throughput functional analysis for expressed polypeptides (http://biotech.nikkeibp.co.jp/netlink/lto/gate- 



48 



EP 1 308 459 A2 



way/). 

[0055] The present invention also relates to a transformant carrying the vector of the invention. Any cell can be used 
as a host into which the vector of the invention is inserted, and various kinds of host cells can be used depending on 
the purposes. For strong expression of the polypeptide in eukaryotic cells, COS cells or CHO cells can be used, for 
5 example. 

[0056] Introduction of the vector into host cells can be performed, for example, by calcium phosphate precipitation 
method, el ectrop oration method (Current Protocols in Molecular Biology (1 987) Ausubel et al. edit, John Wiley & Sons, 
Section 9.1-9.9), lipofectamine method (GIBCO-BRL), or microinjection method, etc. 

[0057] Further, a polynucleotide containing at least 15 nucleotides comprising a nucleotide sequence of any one of 

10 the polynucleotides comprising the nucleotide sequences of SEQ ID NOs shown in Table 1 or the complementary 
strand thereof can be used not only as a primer for synthesizing full-length cDNAs but also for testing and diagnosing 
the abnormalities of the polypeptide encoded by the full-length cDNA of the present invention. For example, by utilizing 
polymerase chain reaction (genomic DNA-PCR, or RT-PCR) using the polynucleotide of the invention as a primer, 
polynucleotide encoding the polypeptide of the invention can be amplified. It is also possible to obtain the regulatory 

15 region of expression in the S'-upstream by using PCR or hybridization since the transcription start site within the genomic 
sequence can be easily specified based on the 5'-end sequence of the full-length cDNA. The obtained genomic region 
can be used for detection and/or diagnosis of the abnormality of the sequence by RFLP analysis, SSCP, or sequencing. 
Especially, in the case where expression of the mRNA of the present invention varies according to a specific disease, 
analysis of the amount of expression of the mRNA using the polynucleotide of the present invention as a probe or a 

20 primer enables detection and diagnosis of the disease. 

[0058] The present invention also relates to antibodies that bind to the polypeptide of the invention. There are no 
limitations in the form of the antibodies of the invention. They include polyclonal antibodies, monoclonal antibodies, or 
their portions that can bind to an antigen. They also include antibodies of all classes. Furthermore, special antibodies 
such as humanized antibodies and chimeric antibodies are also included. 

25 [0059] The polyclonal antibody of the invention can be obtained according to the standard method by synthesizing 
an oligopeptide corresponding to the amino acid sequence and immunizing rabbits with the peptide (Current Protocols 
in Molecular Biology (1987) Ausubel et al. edit, John Wiley & Sons, Section 11.12-11.13). The monoclonal antibody of 
the invention can be obtained according to the standard method by purifying the polypeptide expressed in E. colt, 
immunizing mice with the polypeptide, and producing a hybridoma cell by fusing the spleen cells and myeloma cells 

30 (Current Protocols in Molecular Biology (1987) Ausubel et al. edit, John Wiley & Sons, Section 11 .4-11 .11). 

[0060] The antibody binding to the polypeptide of the present invention can be used for purification of the polypeptide 
of the invention, and also for detection and/or diagnosis of the abnormalities of the expression and structure of the 
polypeptide. Specifically, polypeptides can be extracted, for example, from tissues, blood, or cells, and the polypeptide 
of the invention is detected by Western blotting, immunoprecipitation, or ELISA, etc. for the above purpose. 

35 [0061] Furthermore, the antibody binding to the polypeptide of the present invention can be utilized for treating the 
diseases that associates with the polypeptide of the invention. If the antibodies are used for treating patients, human 
antibodies, humanized antibodies, or chimeric antibodies are preferable in terms of their low antigenicity. The human 
antibodies can be prepared by immunizing a mouse whose immune system is replaced with that of human (e.g., see 
"Functional transplant of megabase human immunoglobulin loci recapitulates human antibody response in mice" Men- 

40 dez, M.J. et al. (1 997) Nat. Genet. 15: 146-1 56). The humanized antibodies can be prepared by recombination of the 
hypervariable region of a monoclonal antibody (Methods in Enzymology (1991) 203: 99-121). 

[0062] A cDNA of the present invention encodes, for example, an amino acid sequence of a protein that is predicted 
to have the following function. The use of the amino acid sequences of the polypeptides encoded by the cDNAs of the 
present invention enables predicting that the polypeptides have the following functions. It can be predict, from the 

45 results of homology search of SwissProt, GenBank, UniGene, or nr, that these polypeptides have such functions. 
Specifically, for instance, as shown in Examples, searching for a known gene or polypeptide that is homologous to the 
partial sequence of the full-length cDNA of the invention (1 970 clone) and referring the function of the gene and of the 
polypeptide encoded by the gene make it possible to predict the function of the polypeptide encoded by the cDNA of 
the invention. In this way, each of 1078 clones out of the 1970 full-length cDNA clones of the invention was predicted 

50 to encode a polypeptide that was classified into the following categories. 



Secretory and/or membrane protein (516 clones) 
Glycoprotein-related protein (121 clones) 
Signal transduction -related protein (88 clones) 
55 Transcription-related protein (1 43 clones) 

Disease-related protein (331 clones) 
Enzyme and/or metabolism-related protein (219 clones) 
Cell division- and/or cell proliferation-related protein (44 clones) 
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Cytoskeleton-related protein (80 clones) 

Nuclear protein and/or RNA synthesis-related protein (70 clones) 
Protein synthesis- and/or transport-related protein (20 clones) 
Cellular defense-related protein (10 clones) 
5 Development and/or differentiation-related protein (1 9 clones) 

DNA- and/or RNA-binding protein (168 clones) 
ATP- and/or GTP-binding protein (93 clones) 

[0063] The functions of the polypeptides encoded by the cDNAs of the present invention can be predicted by as- 

10 sessing the presence of signal sequence, transmembrane region, nuclear translocation signal, glycosylation signal, 
phosphorylation site, and zinc finger motif, SH3 domain, etc. in the amino acid sequences. The programs, PSORT 
(Nakai K., and Kanehisa M. (1992) Genomics 14: 897-911), SOSUI (Hirokawa T. et al. (1998) Bio informatics 14: 
378-379) (Mitsui Knowledge Industry), and MEMSAT (Jones D.T., Taylor W.R., and Thornton J.M. (1994) Biochemistry 
33: 3038-3049) can be used to predict the existence of the signal sequence or transmembrane region. Alternatively, 

15 a partial amino acid sequence of the polypeptide is fused with another polypeptide such as GFP, the fusion polypeptide 
is transfected into cultured cells, and the localization is analyzed to predict the function of the original polypeptide. 
[0064] Based on the determined nucleotide sequences of the full-length cDNAs obtained in the present invention, it 
is possible to predict more detailed functions of the polypeptides encoded by the cDNA clones, for example, by search- 
ing the databases such as GenBank, Swiss-Prot, UniGene, and nr for homologies of the cDNAs; or by searching the 

20 amino acid sequences deduced from the full-length cDNAs for signal sequences by using software programs such as 
PSORT, for transmembrane regions by using software programs such as SOSUI or for motifs by using software pro- 
grams such as Pfam (http://www.sanger.ac.uk/Software/Pfam/index.shtml) and PROSITE (http://www.expasy.ch/ 
prosite/). As a matter of course, the functions are often predictable by using partial sequence information (preferably 
300 nucleotides or more) instead of the full-length nucleotide sequences. However, the result of the prediction by using 

25 partial nucleotide sequence does not always agree with the result obtained by using full-length nucleotide sequence, 
and thus, it is needless to say that the prediction of function is preferably performed based on the full-length nucleotide 
sequences. 

[0065] GenBank, Swiss-Prot, UniGene and nr databases were searched for homologies of the full-length nucleotide 
sequences of the 1970 clones (see Example 6). The amino acid sequences deduced from the full-length nucleotide 
30 sequences were searched for functional domains by PSORT, SOSUI and Pfam. Prediction of functions of polypeptides 
encoded by the clones and the categorization thereof were performed based on these results obtained. The catego- 
rization was carried out by the following method. 

[1 ] Firstly, the cDN A clones were classified into the above-mentioned 1 4 functional categories based on the results 
35 of annotation-based categorization (using the keywords in the case of Swiss-Prot hit data; using Definition or 

Reference information in the case of GenBank, UniGene, or nr hit data), and the signal sequence search of the 
deduced ORFs by PSORT and the transmembrane region search by SOSUI. 

[2] Secondly, clones which had been unassignable to the categories by the method of [1] were searched for func- 
tional domains and/or motifs by Pfam. Based on the results, the clones were additionally classified into the above- 
40 mentioned 14 types of categories when they had a functional domain and/or motif assignable to any one of the 

categories. 

[0066] The following 516 clones presumably belong to secretory and/or membrane proteins. 

ADRGL20020290, ADRGL20021910, ADRGL20036380, ADRGL20036840, ADRGL20059610, ADRGL20063770, 
45 ADRGL20066770, ASTR02001 001 0, ASTRO20020240, ASTRO20045840, ASTRO20053430, AST R02 005 5530, 
ASTRO20055570, ASTRO20055930, ASTRO2008B950, ASTRO20091180, BNGH420021 680, BNGH420023870, 
BNGH420046790, BNGH420052350, BNGH420059680, BNGH4200 75940, BNGH420077980, BRACE1 0000510, 
BRACE20051930, BRACE20052530, B R AC E2 0054080, BRACE20066360, BRACE20068710, BRACE200690 
BRACE20069110, BRACE201 94670, BRACE20204670, BRACE2021 6950, BRAMY1 0001 730, BRAMY20003880, 
50 BRAMY20013670, BRAMY20024790, BRAMY20027390, BRAMY20028530, BRAMY2 0035380, BRAMY20044920, 
BRAMY20045210, BRAMY20047560, BRAMY20050940, BRAMY2005391 0, BRAMY2 0055760, BRA MY20 072440, 
BRAMY20083820, BRAMY20089770, BRAMY20091230, BRAMY20094890, BRAMY20096930, BRAMY2011841 0, 
BRAMY201 23400, BRAMY201 25550, BRAMY201 27310, B RAM Y201 27760, B RAM Y20 135720, BRAMY201 37360, 
BRAMY20 139440, BRAMY201 52510, BRAMY201 94680, BRAMY20204270, BRAMY20225320, BRAMY20237190, 
55 BRAMY20245140, BRAMY20251 750, BRAMY20285650, BRAWH20020470, BRAWH20021 91 0, BRAWH20026010, 
BRAWH20030000, BRAWH20039640, BRAWH 20055330, BRAWH20078620, BRAWH20093070, BRAWH201 85270, 
BRCAN 10000760, BRCAN1 0001 680, BRCAN20001 480, BRCAN20004180, BRCAN20005230, BRCOC20000470, 
BRCOC20003600, BRHIP1 0000720, BRHIP1 0001 040, BRHIP20000210, BRSSN20001 970, BRSSN20074640, 
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15 



20 



25 



30 



35 



45 



50 



BRSSN20091190, 

CTONG20045500, 

CTONG20060040, 

CTONG20076810, 

CTONG201 67750, 

DFNES20029660, 

FCBBF20032930, 

FCBBF30042610, 

FCBBF301 05440, 

FCBBF301 69870, 

FCBBF301 97840, 

FCBBF30260480, 

FEBRA20018670, 

FEBRA20040560, 

FEBRA20116650, 

HCHON10001660, 

HHDPC20082790, 

HLUNG20032460, 

HLUNG20050760, 

HLUNG20088750, 

KIDNE20084800, 

KIDNE20143200, 

KIDNE20186170, 

LIVER20026440, 

MESAN20027900, 

MESAN20095220, 

NT2NE20064780, 

NT2NE201 45250, 

NT2RI20021200, 

NT2RP70056690, 

OCBBF201 09780, 

OCBBF201 23200, 

OCBBF201 85630, 

PLACE60006300, 

PLACE60064740, 

PLACE601 38840, 

PROST20035830, 

PROST20084680, 

PROST20110120, 

SKMUS20006790, 

SMINT20006090, 

SMINT20081330, 

SPLEN20017610, 

SPLEN20063890, 

SPLEN20105100, 

SPLEN201 25230, 

SPLEN201 82850, 

SPLEN201 93490, 

TESTI20006160, 

TESTI20043220, 

TESTI20051200, 

TESTI20059080, 

TESTI20066330, 

TESTI20095200, 

TESTI201 13940, 

TESTI201 49880, 

TESTI201 83680, 

TEST120204260, 



CD34C20001750, CTONG2001 7490, CTONG20036990, CTONG20041260, 
CTONG20049480, CTONG20051450, CTONG20055850, CTONG20056150, 
CTONG20063770, CTONG20065680, CTONG20068360, CTONG20069320, 
CTONG20078340, CTONG20079590, CTONG20083980, CTONG20084020, 
CTONG201 68240, CTONG201 79890, CTONG201 83830, CTONG20 184830, 
DFNES20057660, DFNES20072990, DFNES20080880, FCBBF20018680, 
FCBBF20036360, FCBBF20054390, FCBBF30004340, FCBBF30022680, 
FCBBF30062490, FCBBF30075970, FCBBF30078600, FCBBF30091520, 
FCBBF301 18670, FCBBF301 32660, FCBBF301 35890, FCBBF301 45670, 
FCBBF30171230, FCBBF301 72330, FCBBF301 77290, FCBBF301 79740, 



FCBBF30223210, FCBBF30225930, 



FCBBF301 72330 
FCBBF30212210, FCBBF30223110 
FCBBF30266510, FCBBF30287940, FCBBF5000061 0, FCBBF50004950, 
FEBRA20031280, FEBRA20031810, FEBRA20038220, FEBRA20039260, 
FEBRA20046280, FEBRA20080860, FEBRA20084750, FEBRA2008881 0, 
FEBRA20121950, FEBRA20141980, FEBRA201 77800, FEBRA201 82030, 
HCHON20015050, HEART1 0001490, HEART20031680, HHDPC1 0001 140, 
HHDPC20088160, HLUNG20015070, HLUNG200151 80, HLUNG20020850, 
HLUNG20033350, HLUNG20034970, HLUNG200371 60, HLUNG20041 540, 
HLUNG20052300, HLUNG20060670, HLUNG20065990, HLUNG20074330, 
HLUNG20092530, KIDNE2001 6360, KIDNE20083150, KIDNE20084030, 
KIDNE20086490, KIDNE20086660, KIDNE20094670, KIDNE20134130, 
KIDNE201 48080, KIDNE201 60960, KIDNE201 6371 0, KIDNE201 691 80, 
KIDNE201 88630, KIDNE201 89960, LIVER20007750, LIVER2001 051 0, 
LIVER20030650, LIVER20038000, MESAN20007110, MESAN20008150, 
MESAN20058110, MESAN20059570, MESAN20060430, MESAN20067430, 
NT2NE20018740, NT2NE20021 860, NT2NE2003921 0, NT2NE20053230, 
NT2NE20069580, NT2NE20080770, NT2NE20082130, NT2NE20092950, 
NT2NE201 46510, NT2NE20 152620, NT2NE201 67660, NT2NE20181800, 
NT2RI20033920, NT2RP70003110, NT2RP70027790, NT2RP 7003 1070, 
NT2RP70087140, NTONG20034540, NTONG20053630, OCBBF20000740, 
OCBBF20110210, OCBBF201 10730, OCBBF201 12280, OCBBF201 18720, 
OCBBF201 55030, OCBBF201 65900, OCBBF201 6591 0, OCBBF201 70350, 
OCBBF20191950, PANCR1 0000860, PEBLM20001 800, PLACE50001290, 
PLACE60053280, PLACE60055590, PLACE60056910, PLACE60057860, 
PLACE60070500, PLACE60087680, PLACE 60 104630, PLACE601 07010, 
PLACE60154450, PLACE601 84870, PROST1 0001 100, PROST200111 60, 
PROST20045700, PROST20050390, PROST200651 00, PROST20073280, 
PROST20084720, PROST20099090, PROST201 05450, PROST201 06060, 
PROST20114100, PROST201 46590, PROST201 5251 0, PROST201 68600, 
SKMUS20020770, SKMUS20073150, SKMUS20091900, SKNMC20006350, 
SMINT20008110, SMINT20024140, SMINT20028840, SMINT20045470, 
SMINT20086250, SMINT20088440, SMINT20088690, SMINT200921 60, 
SPLEN20017810, SPLEN20024190, SPLEN20024620, SPLEN20054500, 
SPLEN20073880, SPLEN20080070, SPLEN20090880, SPLEN20101950, 
SPLEN201 08000, SPLEN20110180, SPLEN20110860, SPLEN20118050, 
SPLEN201 36700, SPLEN201 38600, SPLEN20139100, SPLEN201 75920, 
SPLEN201 83020, SPLEN201 83950, SPLEN201 90080, SPLEN201 90770, 



SPLEN201 93790, SPLEN20201 830, 
TESTI20029100, TESTI20031310, 



SPLEN20204670, TESOP1 0000350 
TESTI20032770, TESTI20038240, 



TESTI20045390, 
TESTI20051730, 
TESTI20061200, 
TESTI20068530, 
TESTI20095440, 
TESTI20116120, 
TESTI20151800, 
TESTI201 84750, 
TESTI20210030, 



TESTI20046540, 
TESTI20053260, 
TESTI20062120, 
TESTI20070400, 
TESTI20095880, 
TESTI20121040, 
TESTI201 62780, 
TESTI20186110, 
TESTI20214630, 



TESTI20046870, 
TESTI20053780, 
TESTI20063330, 
TESTI20070740, 
TESTI201 00090, 
TESTI20121710, 
TESTI20170170, 
TESTI201 98540, 
TESTI20219110, 



TESTI20047370, 
TESTI20057200, 
TEST(20063410, 
TESTI20073460, 
TESTI201 02390, 
TESTI20131440, 
TESTI201 73050, 
TESTI20199110, 
TESTI20244730, 



CTONG20044870, 
CTONG20059130, 
CTONG20071680, 
CTONG20085210, 
DFNES20018000, 
FCBBF20029280, 
FCBBF30029250, 
FCBBF30095410, 
FCBBF301 64510, 
FCBBF301 95690, 
FCBBF30230610, 
FEBRA20007820, 
FEBRA20040230, 
FEBRA201 15930, 
FEBRA20191720, 
HHDPC20051850, 
HLUNG20029490, 
HLUNG20042730, 
HLUNG20081390, 
KIDNE20084040, 
KIDNE201 42900, 
KIDNE201 82540, 
LIVER20010990, 
MESAN20021220, 
MESAN20084150, 
NT2NE20059210, 
NT2NE201 40130, 
NT2RI20016240, 
NT2RP70031480, 
OCBBF20012520, 
OCBBF201 20010, 
OCBBF201 76650, 
PLACE60004260, 
PLACE60061370, 
PLACE601 13340, 
PROST20014150, 
PROST20082430, 
PROST201 08850, 
PUAEN1 0000870, 
SKNSH20094350, 
SMINT20077960, 
SPLEN20015100, 
SPLEN20058180, 
SPLEN201 04690, 
SPLEN20121790, 
SPLEN201 77400, 
SPLEN201 93230, 
, TESTI1 00001 90, 
TEST120043130, 
TESTI20050400, 
TESTI20057590, 
TESTI20063600, 
TESTI20086840, 
TESTI201 05910, 
TESTI201 42540, 
TESTI201 82760, 
TESTI20202830, 
TESTI20245600, 
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TESTI20245860, TESTI2024641 0, TESTI20251 610, TESTI2025791 0, TESTI20260640, TESTI20261 040, 
TESTI20262150 : TESTI20262940, TESTI2026491 0, TESTI20271 790, TESTI20278280, TESTI20282420, 
TESTI20282900, TESTI20286590, THYMU20007020, THYMU200 12020, THYMU2001 7270, THYMU20020800, 
THYMU20025480 : THYMU20028150, THYMU20030690, THYMU20034790, THYMU20046350, THYMU20046770, 

5 THYMU20050010, THYMU20052830, THYMU20054800, THYMU20055740, THYMU20055760, THYMU20062770, 
THYMU20078240, THYMU20079690, THYMU20083390, THYMU20087270, THYMU20100940, THYMU20115380, 
THYMU201 37050, THYMU201 37570, THYMU20143230, THYMU20150190, THYMU2015321 0, THYMU201 54790, 
THYMU201 63600, THYMU20171580, THYMU20 178440, THYMU201 85470, TRACH20011 01 0, TRACH20011540, 
TRACH20021380, TRACH20073990, TRACH20081 270, TRACH20090060, TRACH201 49720, TRACH201 49740, 

10 TRACH201 59390, TRACH20 163470, TRACH201 65330, TRACH201 67090, TRACH201 73680, TRACH201 90460, 
UMVEN10001380, UTERU20035770, UTERU20040150, UTERU20045200, UTERU20064120, UTERU20086530, 
UTERU20087070, UTERU20087850, UTERU20089300, UTERU20089620, UTERU20095100, UTERU20099040, 
UTERU201 03200, UTERU20125810, UTERU201 27030, UTERU201271 50, UTERU201 39760, UTERU20188840 
[0067] The following 121 clones presumably belong to glycoprotein- related proteins. 

15 ADRGL20020290, ADRGL20036840, ADRGL20059610, ADRGL20066770, ASTRO20055570, BNGH420046790, 
BNGH420077980, BRACE20051930, BRACE20069000, BRACE20204670, BRACE20216950, BRAMY20013670, 
BRAMY20089770, BRAMY20251 21 0, BRAWH20039640, BRCAN1 0000760, BRCAN20005230, BRCOC20003600, 
CD34C20001750, CTONG20017490, CTONG20036990, CTONG20045500, CTONG20059130, CTONG20079590, 
CTONG20085210, CTONG201 84830, DFNES2001 8000, DFNES20080880, FCBBF30004340, FCBBF30029250, 

20 FCBBF30062490, FCBBF30091 520, FCBBF301 6451 0, FCBBF301 71 230, FCBBF30195690, FCBBF3022321 0, 
FEBRA20038220, HCHON2001 5050, HLUNG20015070, HLUNG20032460, HLUNG200371 60, HLUNG20041540, 
KIDNE20142900, KIDNE201 691 80, KIDNE20186170, KIDNE201 89960, MESAN20021 220, MESAN2005811 0, 
NT2NE20064780, NT2NE20140130, NT2NE201 55650, NT2RP70056690, NTONG20053630, OCBBF20000740, 
OCBBF20012520, OCBBF201 10210, OCBBF20120010, OCBBF201 65900, OCBBF201 65910, OCBBF20191950, 

25 PEBLM20001800, PLACE60004260, PLACE60087680, PLACE601 13340, PLACE601 84870, PROST20033240, 
PROST20099090, PROST201 08850, PROST201 46590, SKMUS20073150, SKNMC20006350, SM1NT20028840, 
SMINT20056230, SMINT20083290, SMINT200911 90, SPLEN20024620, SPLEN20063890, SPLEN20080070, 
SPLEN20090880, SPLEN201 1 8050, SPLEN20139100, SPLEN201 83020, SPLEN20201 830, TESTM 00001 90, 
TESTI20031 31 0, TESTI20043990, TESTI20045390, TESTI20051 200, TESTI20057590, TESTI20059080, 

30 TESTI20066330, TESTI20086840, TESTI201 00090, TESTI201 05910, TESTI20 154370, TESTI201 64210, 
TESTI201 82760, TESTI201 84750, TESTI201 991 1 0, TESTI2021 911 0, TESTI20220230, TESTI20245600, 
TESTI20251610, TESTI2025791 0, TESTI20286590, THYMU20024500, THYMU200281 50, THYMU20052830, 
THYMU20062770, THYMU20099060, THYMU201 70080, THYMU201 78440, TRACH20011 01 0, TRACH20011540, 
TRACH20021380, TRACH201 49740, TRACH201 70860, TRACH201 90460, UTERU20086530, UTERU20087070, 

35 UTERU201 27030 

[0068] The following 88 clones presumably belong to signal transduction-related proteins. 

ASTRO20050810, ASTRO20052420, ASTRO20085080, ASTRO20090680, BNGH420008150, BNGH42001 5760, 
BNGH420035290, BNGH420086030, BRAMY20035830, BRAMY20043630, BRAMY201 18490, BRAMY20206340, 
BRAMY20244490, BRAMY20251210, BRAMY20263000, BRAWH20093040, BRAWH20 190550, CTONG20004520, 

40 CTONG20029030, CTONG20030280, CTONG20063930, CTONG20070720, CTONG201 89000, FCBBF300011 00, 
FCBBF30076310, FCBBF301 00080, FCBBF301 43550, FCBBF30153170, FCBBF301 75350, FCBBF30250980, 
FEBRA20090160, FEBRA201 73330, HCHON20000870, HLUNG20011260, HLUNG20084790, KIDNE20089870, 
KIDNE201 60360, LIVER20011 640, MESAN20021130, MESAN20027240, MESAN20065990, NT2NE2001 8890, 
NT2NE20042550, NT2RP70075800, NTONG20043080, NTONG2 0048440, PLACE60071800, PROST20033240, 

45 PROST20052850, PROST20065790, PROST20075280, SKNSH20052400, SKNSH20057920, SMINT20006020, 
SMINT20035050, SPLEN20023540, SPLEN200391 80, SPLEN20048800, SPLEN20049840, SPLEN200541 60, 
SPLEN2008591 0, SPLEN201 91 020, SPLEN201 98390, TESTI20046490, TESTI20049060, TESTI20053070, 
TESTI20066650, TESTI20081 890, TESTI20095770, TESTI201 06820, TESTI201 1 2860, TESTI201 45780, 
TESTI201 50420, TESTI201 68880, TESTI20205250, TESTI202281 20, TESTI20244220, TESTI20244460, 

50 TESTI20251740, TESTI202611 60, TESTI20264530, THYMU2001 3250, THYMU20039320, THYMU201 06990, 
THYMU201 45990, THYMU201 70080, THYMU20176010, TRACH20188350 
[0069] The following 1 43 clones presumably belong to transcription-related proteins. 

ASTRO20038400, ASTRO20075150, BNGH420070370, BNGH420074600, BNGH420087430, BRACE20003310, 
BRACE20061620, BRAMY20001510, BRAMY20040580, BRAMY20076100, BRAMY20111780, BRAWH20040680, 
55 BRAWH20050740, BRAWH20080580, BRAWH20082920, BRAWH20095900, BRSSN20066440, CTONG20020950, 
CTONG20044230, CTONG20053990, CTONG20072930, CTONG20074000, CTONG200 84660, CTONG201 86370, 
CTONG201 86520, DFNES20028170, DFNES20046840, DFNES20073320, FCBBF3000361 0, FCBBF3001 9140, 
FCBBF30021900, FCBBF300931 70, FCBBF3011 4850, FCBBF3012901 0, FCBBF301 36230, FCBBF301 43550, 
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FCBBF30220050, FCBBF30228940, FCBBF30263080, FCBBF30285930, FCBBF50003530, FEBRA20026820, 
FEBRA20027070, FEBRA20046510, FEBRA20057010, FEBRA20063720, FEBRA201 70240, HCHON1 0000150, 
HCHON20002650, HEART2001 931 0, HLUNG20014590, HLUNG2002811 0, HLUNG2006370 KIDNE201 40870, 
LIVER20006260, MESAN20016270, MESAN20038520, NT2NE20038870, NT2NE20053950, NT2NE20060750, 

5 NT2NE20061030, NT2NE20079670, NT2NE20082600, NT2RP70001120, NT2RP70029780, NT2RP7004641 0, 
NT2RP70057500, NT2RP70075300, NT2RP70090870, OCBBF201 16250, OCBBF20120950, OCBBF20121 910, 
OCBBF201 56450. OCBBF201 57970, OCBBF201 66900, OCBBF201 75360, OCBBF201 77540, PEBLM20003260, 
PLACE60052940, PLACE60066970, PLACE601 22970, PLACE6015051 0, PLACE60 177880, PROST200071 70, 
PROST20024250, PROST20035170, PROST201 27450, PROST20151370, PROST201 55370, PUAEN1 0000650, 

10 PUAEN20003120, SMINT20011 950, SMINT20026200, SMINT20030740, SMINT20039050, SMINT20044140, 
SMINT20086720, SPLEN20042200, SPLEN20043680, SPLEN20055600, SPLEN20059270, SPLEN20063250, 
SPLEN20098030, SPLEN201 97930, TESTI1 0001 570, TESTI20057430, TESTI20057840, TESTI2005981 0, 
TESTI20067480, TESTI20068790, TESTI20075240, TESTI20079220, TESTI20088840, TESTI201 04090, 
TESTI201 22070, TESTI201 66670, TESTI201 71 070, TESTI201 73960, TESTI201 84760, TESTI201 94880, 

15 TESTI201 97600, TESTI20228740, TESTI20254030, TESTI20254990, TESTI20266050, TESTI20274960, 
TESTI20282530, THYMU 10004280, THYMU20019260, THYMU20032820, THYMU200711 20, THYMU20077250, 
TKIDN1 0001 920, UTERU20016580, UTERU20026620, UTERU20041630, UTERU20094830, UTERU20099510, 
UTERU20101150, UTERU201 69020, UTERU201 77150, UTERU201 85220, UTERU201 88670 
[0070] The following 331 clones presumably belong to disease-related proteins. 

20 ADRGL20020290, ADRGL20021 91 0, ADRGL20026790, ADRGL20036840, ADRGL2005961 0, ADRGL20066770, 
ASTRO20038400, ASTRO20052420. ASTRO20055570, ASTRO200751 50, ASTRO20088950, BNGH420008150, 
BNGH420086030, BRACE1 0000510, BRACE20003310, BRACE20069000, BRACE20097540, BRACE201 94670, 
BRACE20196180, BRACE20204670, BRACE20216950, BRAMY20003540, BRAMY20005080, BRAMY20035830, 
BRAMY20040580, B RAM Y2 0043630, BRAMY20044920, BRAMY20051820, BRAMY20056620, BARMY20089770, 

25 BRAMY20111780, BRAMY201 5251 0, BRAMY201 90550, BRAMY20221 600, BRAMY20227860, BRAMY2027451 0, 
BRAWH20082920, BRAWH20093040, BRAWH20095900, BRAWH201 90530, BRAWH201 91 980, BRCAN1 0000760, 
BRCAN 10001050, BRCAN20005230, BRSSN20066440, CTONG20004520, CTONG20029030, CTONG20042640, 
CTONG20045500, CTONG20052780, CTONG20053990, CTONG20070780, CTONG2007091 0, CTONG20072930, 
CTONG20083980, CTONG20084660, CTONG201 65750, CTONG201 69040, CTONG2 01 83430, CTONG201 83830, 

30 CTONG201 86290, CTONG201 89000, DFNES200 16470, DFNES20025500, DFNES20046840, DFNES20055400, 
DFNES20080880, FCBBF1 0000230, FCBBF20035490, FCBBF20066340, FCBBF30002270, FCBBF30002280, 
FCBBF30019140, FCBBF30053300, FCBBF30071500, FCBBF30072440, FCBBF3007631 0, FCBBF30080730, 
FCBBF301 00080, FCBBF301 15920, FCBBF301 18670, FCBBF301 2901 0 ; FCBBF301 32050, FCBBF301 36230, 
FCBBF30153170, FCBBF301 6451 0, FCBBF301 66220, FCBBF301 71230, FCBBF301 75350, FCBBF301 94550, 

35 FCBBF30220050, FCBBF3022321 0, FCBBF30259050, FCBBF30263080, FCBBF30275590, FCBBF50001 650, 
FEBRA20027070, FEBRA20045380, FEBRA20046200, FEBRA2004651 0, FEBRA20057010, FEBRA20063720, 
FEBRA20078800, FEBRA20087550, FEBRA20088810, FEBRA20090160, FEBRA20092760, FEBRA20151 750, 
FEBRA201 70240, FEBRA201 73330, FEBRA20191720, HCHON1 00001 50, HCHON20015050, HEART20009590, 
HEART20022200, HEART2 00631 00, HHDPC20081230, HLUNG20008460, HLUNG2001 4590, HLUNG20032460, 

40 HLUNG20063700, HLUNG20065990, HLUNG20069350, HLUNG20081530, HLUNG20082350, HLUNG20083330, 
HLUNG20085210, KIDNE20081170, KIDNE20084040, KIDNE20088240, KIDNE20089870, KIDNE201 33460, 
KIDNE20134890, KIDNE20141 700, KIDNE20142900, KIDNE201 50730, KIDNE201 52440, KIDNE201 60360, 
KIDNE201 65390, KIDNE201 691 80, KIDNE201 73430, KIDNE201 89960, LIVER20026440, MESAN20006200, 
MESAN20021130, MESAN20033220, MESAN20056890, MESAN20057240, MESAN20065990, MESAN20067430, 

45 MESAN20069530, NESOP20004520, NT2NE20018890, NT2NE20026200, NT2NE20037050, NT2NE20053950, 
NT2NE20061030, NT2NE20111190, NT2NE201 17580, NT2NE201 19980, NT2NE20140130, NT2NE20141040, 
NT2RI20093010, NT2RP700031 1 0, NT2RP70046410, NT2RP70075300, NTONG200321 00, NTONG2 0034 540, 
OCBBF20000740, OCBBF2001 2520, OCBBF20111600, OCBBF2012001 0, OCBBF201 56450, OCBBF201 57970, 
OCBBF20191950, PEBLM20001800, PEBLM20003260, PLACE60004260, PLACE6001262 PLACE60054230, 

50 PLACE60054870, PLACE60062660, PLACE60087680, PLACE601 84870, PROST2001 521 0, PROST20024250, 
PROST20036350, PROST20050390, PROST20058860, PROST200 63430, PROST20065790, PROST20084720, 
PROST20099090, PROST20 120070, PROST20 127450, PROST20 146590, PROST20152510, PROST201 68600, 
PUAEN 10000650, PUAEN20003120, SKMUS20008730, SKMUS20017400, SKMUS20040440, SKMUS20073590, 
SKMUS20079150, SKNSH2000971 0, SMINT20002320, SMINT20007470, SMINT200081 1 0, SMINT20011950, 

55 SMINT20016150, SM I NT2 0026200, SMINT20030740, SMINT20049920, SMINT20077960, SMINT20083290, 
SMINT20086250, SMINT20089600, SMINT200911 90, SPLEN20023540, SPLEN200241 90, SPLEN20042200, 
SPLEN20043680, SPLEN20055600, SPLEN20057830, SPLEN20059270, SPLEN20063890, SPLEN20073500, 
SPLEN20080070, SPLEN2008591 0, SPLEN20090880, SPLEN20098030, SPLEN201 18050, SPLEN201 36730, 
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SPLEN201 38600, SPLEN20139100, SPLEN201 39360, SPLEN201 80980, SPLEN201 87490, SPLEN201 93790, 
SPLEN20201 830, TEST 11 00001 90, TESTI20031 31 0, TESTI20035790, TESTI20041 630, TESTI20049060, 
TESTI20050720, TESTI20051 200, TESTI 20057430, TESTI20057590, TESTI20059080, TESTI200621 20 ; 
TESTI20067480, TESTI20071 630, TESTI20099350, TESTI201 051 30, TESTI201 0591 0, TESTI201 08060, 

5 TESTI201 25920, TESTI201 30530, TESTI20131440, TESTI201 34680, TESTI201 42540, TESTI20143180 : 
TESTI201 50420, TESTI201 54370, TESTI201 6421 0, TESTI201 66670, TESTI201 68880, TESTI201 71 070, 
TESTI201 82760, TESTI201 84750, TESTI201 93080, TESTI201 94880, TESTI201 96970, TEST1201 97600, 
TESTI20201 760, TESTI202071 70, TESTI2021 91 1 0, TESTI20228740, TESTI20244430, TESTI20246480, 
TESTI20251 740, TESTI20252690, TESTI20254030, TESTI2025791 0, TESTI20258720, TESTI20266050, 

10 TESTI20271790, TESTI20274960, TESTI20282530, TESTI20286590, THYMU 10004280, THYMU20006020, 
THYMU20013250, THYMU20019260, THYMU20023560, THYMU20028150, THYMU20032820, THYMU20034400, 
THYMU20055460, THYMU20063650, THYMU20070250, THYMU20071120, THYMU20081110, THYMU20090230, 
THYMU20095920, THYMU20098350, THYMU20099060, THYMU201 20730, THYMU20121040, THYMU201 70080, 
THYMU201 85650, THYMU20191970, TKIDN1 0000620, TKIDN10001 920, TRACH20011540, TRACH20091070, 

15 TRACH201 43710, TRACH201 70860, UTERU1 0001 060, UTERU20026620, UTERU20041630, UTERU20086530, 
UTERU20087070, UTERU20087850, UTERU20099510, UTERU20101150, UTERU201 04310, UTERU201 27030, 
UTERU201 85220 

[0071 ] The following 2 1 9 clones presumably belong to the category of enzymes and/or metabolism-related proteins. 
ADRGL20059610, ASTRO20026320, ASTRO20050810, ASTRO20088950, BNGH420008150, BNGH420035290, 

20 BNGH420074600, BRACE20050870, BRACE20097540, BRACE20200770, BRACE20204670, BRACE2021 5410, 
BRAMY20003540, BRAMY20005080, BRAMY20027990, BRAMY20028620, BRAMY20044920, BRAMY20055760 : 
BRAMY20056620, BRAMY20072870, BRAMY20093490, BRAMY20096930, BRAMY2011 8490, BR AM Y20 125360, 
BRAMY201 43870, BR AMY20 15251 0, BRAMY20231150, BRAMY20244490, BRAMY20251210, BRAWH20021 91 0, 
BRAWH20082920, BRAWH20093040, BRAWH20094900, BRAWH201 831 70, BRAWH201 88750, BRAWH201 90550, 

25 BRAWH20191980, BRCAN20005230, BRCOC20003600, CTONG20051100, CTONG2007091 0, CTONG2007681 0, 
CTONG20079590, CTONG20080140, CTONG20085210, CTONG201 86290, DFNES20063460, DFNES20080880, 
FCBBF20023490, FCBBF20066340, FCBBF30 004340, FCBBF30019140, FCBBF30022680, FCBBF30029250, 
FCBBF30072440, FCBBF3007631 0, FCBBF30085560, FCBBF30091 520, FCBBF301 07290, FCBBF301 25880, 
FCBBF301 32050, FCBBF301 43550, FCBBF30153170, FCBBF301 66220, FCBBF301 71230, FCBBF30 175350, 

30 FCBBF30236670, FCBBF30260480, FEBRA20038220, FEBRA20040560, FEBRA20078800, FEBRA20090160, 
FEBRA201 72230, FEBRA201 73330, HCHON20000870, HCHON20002710, HEART1 0001 490, HEART20022200, 
HEART20047640, HEART20082570, HLUNG20011260, HLUNG20032460, HLUNG20041 540, HLUNG20042730, 
HLUNG20054790, KIDNE20080690, KIDNE20083620, KIDNE20084040, KIDNE201471 70, KIDNE201 52440, 
KIDNE20173150, KIDNE201 861 70, KIDNE20189960, LIVER20011 640, LIVER20026440, LIVER20055270, 

35 MESAN20021130, MESAN20033220, MESAN20038520, MESAN20057240, MESAN20058110, MESAN20065990, 
MESAN20095800, NT2NE20026200, NT2NE20042550, NT2NE2011 7580, NT2NE201 27900, NT2RI20093010, 
NT2RP70064570, NTONG20 034540, NTONG20043080, NTONG20053630, NTONG20053730, NTONG20058010, 
OCBBF201 20010, OCBBF201 67290, OCBBF20191950, PANCR1 0000860, PLACE60052940, PLACE60064180, 
PLACE60073090, PLACE60095600, PLACE601 84410, PLACE601 88630, PROST20007600, PROST20033240, 

40 PROST20036350, PROST20039300, PROST20050390, PROST2005131 0, PROST20052850, PROST20065790, 
PROST20075280, PROST200 84720, PROST20099090, PROST201 08850, PROST201 52510, PUAEN20001520, 
PUAEN20002470, SKNMC20006350, SKNSH20057920, SMINT2000811 0, SMINT20049920, SMINT20094680, 
SPLEN20023540, SPLEN20024930, SPLEN20043680, SPLEN20048800, SPLEN20054500, SPLEN20057900, 
SPLEN20071820, SPLEN20080070, SPLEN2008591 0, SPLEN201 08000, SPLEN201 36730, SPLEN201 80980, 

45 TESTI20012080, TESTI20030200, TESTI20031310, TESTI20038240, TESTI20050720, TESTI20051200, 
TESTI20059080, TESTI20062120, TESTI20066330, TESTI20076570, TESTI201 03690, TESTI201 05130, 
TESTI201 06820, TESTI201 08060, TESTI201 1 2860, TESTI201 21 040, TEST1201 30530, TESTI201 31 440, 
TESTI201 68880, TESTI201 701 70, TESTI201 96690, TESTI201 96970, TESTI201 991 1 0, TESTI20205250, 
TESTI2021 2970, TESTI20222030, TESTI20226520, TESTI20227380, TESTI 20244460, TESTI20244730, 

50 TESTI20250630, TESTI20260640, TESTI20262940, TESTI 20264530, TESTI20285230, THYMU20006020, 
THYMU20013250, THYMU20034400, THYMU20039320, THYMU20055460, THYMU20055760, THYMU20063650, 
THYMU20066660, THYMU20070250, THYMU20087270, THYMU20096580, THYMU201 00940, THYMU20110720, 
THYMU201 20240, THYMU201 20730, THYMU201 70230, TRACH2001 101 0, TRACH20021380, TRACH20091070, 
TRACH201 13020, TRACH2014371 0, TRACH201 641 00, TRACH201 90460, UTERU20087070, UTERU20089620, 

55 UTERU201 04310, UTERU20 185220, UTERU201 88670 

[0072] The following 44 clones presumably belong to the category of cell division- and/or cell proliferation-related 
proteins. 

ASTRO20090680, BRACE20079370, BRAMY20234820, BRCAN1 0001 050, BRCAN2000541 0, CTONG20032930, 
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FC8BF20070950, FCBBF30002270, FCBBF30053300, FCBBF301 05860, FCBBF301 75350, FCBBF302 15240, 
FCBBF30275590, FEBRA20045380 : HLUNG20068120, KIDNE20134890, KIDNE201 50730, MESAN20021470, 
NT2NE20077250, NT2NE201 53620, NT2RP70030840, NTONG2005391 0, OCBBF20111370, OCBBF201 74580, 
PROST20063430, SKNMC10001230, SMINT20028800, SPLEN20023540, SPLEN20057830, SPLEN201 39360, 
5 TESTI20031410, TESTI20057840, TESTI20065650, TESTI20066650, TESTI201 07320, TESTI201 08060, 
TESTI201 1 4480, TESTI201 34680, TESTI201 431 8 TESTI20 1 50920, TESTI20201 760, TESTI20278280, 
TESTI20284260, THYMU20097920 

[0073] The following 80 clones presumably belong to the category of cytoskeleton-related proteins. 
ADRGL20062330, ASTR020053430, BGGI120000670, BRACE20079370, BRAMY20038980, BRAMY20083330, 

w BRAMY20094890, CTONG20004110, CTONG20032930, CTONG20077760, CTONG20083980, CTONG201 69040, 
CTONG201 83430, DFNES2001 8000, FCBBF301 05860, FCBBF301 30410, FCBBF30 194550, FCBBF30201630, 
FCBBF30271990, FEBRA20005040, FEBRA20046200, FEBRA20099860, HCHON20015050, HLUNG20081530, 
KIDNE20081170, NT2RP70001 730, NT2RP70003110, NTONG2 0032 100, OCBBF201 66890, OCBBF201 74890, 
PLACE60054870, PLACE60055590, PLACE60071800, PLACE601 18810, PROST20015210, PROST20097840, 

15 PROST20120070, PROST201 46590, SKMUS20007260, SKMUS20008730, SKMUS20017400, SKMUS20073590, 
SMINT20062050, SMINT20074330, SMINT20077960, SPLEN200391 80, SPLEN20049840, SPLEN20076470, 
SPLEN201 82990, SPLEN201 87490, SPLEN201 95710, TEST1 10000 190, TESTI20041630, TESTI20057880, 
TESTI20058920, TESTI20060080, TESTI20064530, TESTI20064650, TESTI20065650, TESTI20067440, 
TESTI20071 1 30, TESTI20099350, TESTI201 1 2540, TESTI201 25280, TESTI201 3601 0, TESTI201 5331 0, 

20 TESTI201 75370, TESTI20222460, TESTI20244430, TESTI20254030, TESTI20258720, THYMU20024500, 
THYMU20062610, THYMU20098350 : TRACH20043360, TRACH2009851 0, TRACH201 49500, UTERU20089390, 
UTERU201 22520, UTERU201 68960 

[0074] The following 70 clones presumably belong to the category of nuclear proteins and/or RNA synthesis- related 
proteins. 

25 ASTRO20026320, BRACE20050870, BRACE20200770, BRAMY201 34050, BRAWH2006301 0, BRAWH20093040, 
BRAWH20 174330, BRAWH201 76850, CTONG20042640, FCBBF20023490, FCBBF20035490, FCBBF20070950, 
FCBBF30002270, FCBBF30048420, FCBBF30080730, FCBBF3011 5920, FCBBF30236670, FEBRA20035240, 
FEBRA20092760, FEBRA201 73330, HHDPC20081230, HLUNG20011460, HLUNG20068120, KIDNE20089870, 
KIDNE201 50730, MESAN20056890, MESAN20057240, NT2NE20037050, NT2NE20 167660, NT2RP70031070, 

30 NTONG20053730, PLACE600641 80, PLACE60095600, PROST20016760, PROST2005131 0, PROST20058860, 
PROST201 52510, PUAEN20002470, SKMUS20079150, SKNSH20030640, SPLEN20023850, SPLEN20057830, 
SPLEN201 39360, SPLEN201 90430, TESTI20006830, TESTI20030200, TESTI20031 41 0, TESTI20035790, 
TESTI200621 20, TESTI20065650, TESTI20081 890, TESTI201 50920, TESTI201 5331 0, TESTI20201 760, 
TESTI2021 2970, TESTI20227380, TESTI20251 740, TESTI20256560, TESTI20260640, TESTI202701 30, 

35 TESTI20284260, TESTI20285230, THYMU20021 090, THYMU20049060, THYMU20066660, THYMU20081110, 
THYMU20090230, THYMU201 20240, UTERU1 0001 060, UTERU201 04310 

[0075] The following 20 clones presumably belong to the category of protein synthesis- and/or protein transport- 
related proteins. 

BRAMY20038980, BRAMY2027451 0, CTONG200081 90, CTONG2003361 0, FCBBF2001 8680, FEBRA20090220, 
40 KIDNE20141700, NT2NE201 67660, NTONG20055200, PLACE600 12620, PROST20036350, PROST20062820, 
SKMUS20040440, SMINT20000070, SPLEN201 80980, TESTI20055680, TESTI20067440, TESTI201 07240, 
THYMU20096580, THYMU20121 040 

[0076] The following 1 0 clones presumably belong to the category of cellular defense -related proteins. 
ASTRO20089600, BRAMY201 17670, FEBRA20087550, HLUNG20081390, MESAN20057240, NTONG20031 580, 
45 PROST20007600, SPLEN20023850, SPLEN20043680, TESTI20261680 

[0077] The following 1 9 clones presumably belong to the category of development and/or differentiation -related pro- 
teins. 

BRACE20061620, BRACE20200770, BRAMY20013670, CTONG20017490, CTONG20020950, HCHON1 0000150, 
MESAN20021470, OCBBF201 65910, PROST201 55370, PUAEN20002470, TESTI20079220, TESTI20079980, 
50 TESTI201 66670, TESTI201 84760, TESTI20252690, TRACH20040390, UTERU20089620, UTERU20094830, 
UTERU201 69020 

[0078] The following 1 68 clones presumably belong to the category of DNA- and/or RNA-binding proteins. 
ASTRO20038400, BGG1 12001 0750, BNGH420070370, BRACE20003310, BRACE20061620, BRAMY20001510, 
BRAMY20040580, BRAMY20076100, BRAMY201 11 780, BRAMY2027451 0, BRAWH20040680, BRAWH20050740, 
55 BRAWH20063010, BRAWH20080580, BRAWH20095900, BRAWH201 74330, BRSSN20066440, CTONG20020950, 
CTONG20044230, CTONG20053990, CTONG20072930, CTONG20074000, CTONG201 65750, CTONG201 86370, 
CTONG201 86520, DFNES20046840, DFNES20073320, FCBBF20035430, FCBBF20070950, FCBBF30002270, 
FCBBF30003610, FCBBF30019140, FCBBF30021 900, FCBBF30048420, FCBBF30080730, FCBBF300931 70, 
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FCBBF301 14850, FCBBF3012901 0, FCBBF301 36230, FCBBF30220050, FCBBF30228940, FCBBF30236670, 
FCBBF30263080, FCBBF30285930, FCBBF50003530, FEBRA20026820, FEBRA20027070, FEBRA20035240 : 
FEBRA20046510, FEBRA2005701 0, FEBRA20063720, FEBRA20087550, FEBRA20092760, FE BR A201 70240, 
FEBRA201 77800, HCHON20002650, HEART20019310, HEART200631 00, HHDPC20081230, HLUNG20011460, 

5 HLUNG20014590, HLUNG2002811 0, HLUNG20063700, HLUNG20068120 i KIDNE20140870, LIVER20006260, 
MESAN20016270, MESAN20056890, MESAN20057240, NT2NE20038870, NT2NE20053950, NT2NE20060750, 
NT2NE20079670, NT2NE20082600, NT2NE20087270, NT2RP70029780, NT2RP70046410, NT2RP70057500, 
NT2RP70075300, NT2RP70090870, OCBBF2011 6250, OCBBF201 20950, OCBBF20121910, OCBBF20 156450, 
OCBBF201 57970, OCBBF201 66900, OCBBF201 75360, OCBBF201 77540, PEBLM 10001 470, PEBLM20003260, 

io PLACE60066970, PLACE601 22970, PLACE601 77880, PROST200071 70, PROST20024250, PROST20035170, 
PROST20051310, PROST20058860, PROST20151 370, PROST20155370, PUAEN20003120, SMINT20011950, 
SMINT20030740, SMINT20039050, SMINT20044140, SMINT20086720, SPLEN20042200, SPLEN20043680, 
SPLEN20055600, SPLEN20059270, SPLEN20063250, SPLEN201 39360, SPLEN201 90430, TESTI1 0001 570, 
TESTI20006830, TESTI20030200, TESTI20031 41 0, TESTI20035790, TESTI20057430, TESTI2005981 0, 

15 TESTI20062120, TESTI20067480, TESTI20068790, TESTI20075240, TESTI20079220, TESTI20088840, 
TESTI201 04090, TESTI201 34970, TESTI201 66670, TESTI201 71 070, TESTI201 73960, TESTI201 84760, 
TESTI201 97600, TESTI20201 760, TESTI2021 2970, TESTI20227380, TESTI20228740, TESTI20246480, 
TESTI20254030, TESTI20254990, TESTI20266050, TESTI20268240, TESTI202701 30, TESTI20274960, 
TESTI20282530, TESTI20284260, TESTI20285230, THYMU 10004280, THYMU20019260, THYMU20023560, 

20 THYMU20032820, THYMU20049060, THYMU20066660, THYMU20071120, THYMU20077250, THYMU20081110, 
THYMU20090230, TK1DN1 0001 920, TRACH201 08240, UTERU10001 060 ; UTERU20026620, UTERU20041 630, 
UTERU20094830, UTERU20099510, UTERU20101150, UTERU201 69020, UTERU20177150, UTERU201 88670 
[0079] The following 93 clones presumably belong to the category of ATP- and/or GTP-binding proteins. 
ASTRO20026320, BNGH420035290, BRACE20050870, BRACE20079370, BRACE20200770, BRAMY20055760, 

25 BRAMY20118490, B R AM Y2 0244490, BRAMY20251 21 0, BRAWH20093040, BRAWH20 190550, BRCAN1 0001 050, 
BRCOC20003600, CTONG200081 90, CTONG20030280, CTONG20032930, CTONG201 76040, CTONG201 84830, 
FCBBF20023490, FCBBF30019140, FCBBF3007631 0, FCBBF301 05860, FCBBF301 75350, FCBBF30201 630, 
FCBBF30236670, FEBRA20005040, FEBRA200901 60, F E BR A20 173330, HCHON20000870, HLUNG20011260, 
HLUNG20052300, KIDNE20081170, KIDNE201 34890, LIVER20030650, LIVER20055270, MESAN20065990, 

30 NT2NE20042550, NTONG20043080, NTONG20055200, OCBBF201 82060, PLACE60054870, PLACE60064180, 
PLACE60095600, PLACE 60 140640, PROST20015210, PROST20033240, PROST20036350, PROST2005131 0, 
PROST20052850, PROST20062820, PROST20075280, PROST201 20070, PUAEN20002470, SKNSH20052400, 
SKNSH20057920, SMINT20008110, SPLEN20023850, SPLEN20043680, SPLEN20049840, SPLEN201 36730, 
SPLEN201 80980, SPLEN201 93790, TESTI20055680, TESTI20058920, TESTI20060080, TESTI20064650, 

35 TESTI20071130, TESTI20099350, TESTI201 06820, TESTI2011 2860, TESTI201 34680, TESTI201 3601 0, 
TESTI201 431 80, TESTI201 75370, TESTI2021 2970, TESTI20222460, TESTI20227380, TESTI20244220, 
TESTI20244460, TESTI20264530, THYMU20013250, THYMU20039320, THYMU20062610, THYMU20066660, 
THYMU20087270, THYMU20096580, THYMU201 00940, THYMU201 76010, TRACH20043360, TRACH20098510, 
TRACH201 13020, UTERU201 85220, UTERU201 88670 

40 [0080] Among the clones other than the ones shown above, BNGH42003641 0, FCBBF30257370 are clones which 
were predicted to highly possibly belong to the category of secretory protein and/or membrane protein based on the 
result of domain search by Pfam. 
SMINT20044730, TESTI201 40970 

[0081] The two clones shown above are clones which were predicted to highly possibly belong to the category of 
45 glycoprotein- related protein based on the result of domain search by Pfam. 

BRACE20055560, CTONG20046690, DFNES20043710, FCBBF30005500, MESAN20030350, MESAN20030370, 
PLACE60074820, TESTI20058350, TESTI20106170, TRACH20131230, UTERU20000950 

[0082] The 11 clones shown above are clones which were predicted to highly possibly belong to the category of 
signal transduction-related protein based on the result of domain search by Pfam. 

50 ASTRO20010290, BRACE20099070, CTONG20007660, DFNES20076340, DFNES20094820, FCBBF301 25460, 
FCBBF301 42290, FCBBF301 69280, FEBRA20031000, NT2NE20026510, NT2RP70031340, P LAC E5000 1390, 
SPLEN201 35030, TESTI20046890, TESTI20060350, TESTI20 166290, TESTI20259110, THYMU2 01 84550 
[0083] The 18 clones shown above are clones which were predicted to highly possibly belong to the category of 
transcription-related protein based on the result of domain search by Pfam. 

55 ADRGL20047770, ADRGL20079060, BRACE200 14450, BRACE20051 600, BRAWH201 85260, CTONG20033750, 
CTONG20070090, CTONG201 90290, FCBBF20020440, FCBBF30005360.. FCBBF301 73960, FEBRA20031000, 
KIDNE20087880, LIVER20013890, ME SAN 20030350, MESAN20030370, OCBBF20113110, PLACE60074820, 
PLACE60093380, PROST20028970, PROST20102190, SALGL1 0001 070, SPLEN20006950, SPLEN20011350, 
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SPLEN20050090, TESTI20060830, TESTI200661 50, TESTI201 20900, TESTI201 3231 0, TESTI201 48380, 
TESTI201 62980, TESTI201 66290, TESTI20205100, THYMU20112590 : TRACH200298BO 

[0084] The 35 clones shown above are clones which were predicted to highly possibly belong to the category of 
enzyme and/or metabolism-related protein based on the result of domain search by Pfam. 
5 PLACE60054820, TESTI201 97030 

[0085] The two clones shown above are clones which were predicted to highly possibly belong to the category of 
cell division and/or cell proliferation-related protein based on the result of domain search by Ram. 
ASTRO20006530, OCBBF2001 6390, TRACH20058000 

[0086] The three clones shown above are clones which were predicted to highly possibly belong to the category of 
10 cytoskeleton-related protein based on the result of domain search by Pfam. 
BRACE20065470, PLACE60054820 

[0087] The two clones shown above are clones which were predicted to highly possibly belong to the category of 
nuclear protein and/or RNA synthesis-related protein based on the result of domain search by Pfam. 
ASTRO20010290, BRACE20099070, BRAWH20014590, CTONG20007660, DFNES20076340, DFNES20094820, 
15 FCBBF301 25460, FCBBF301 42290, FCBBF301 69280, FEBRA20031000, MESAN20034440, NT2NE20026510, 
NT2RP70031340, PLACE50001390, SPLEN201 35030, TESTI20046890, TESTI20060350, TESTI201 66290, 
TESTI2025911 0, THYMU201 04480, THYMU201 84550 

[0088] The 21 clones shown above are clones which were predicted to highly possibly belong to the category of 
DNA- and/or RNA-binding protein based on the result of domain search by Pfam. 
20 KIDNE201 33880, MESAN20030350, MESAN20030370, TESTI20059480 

[0089] The four clones shown above are clones which were predicted to highly possibly belong to the category of 
ATP- and/or GTP-binding proteins based on the result of domain search by Pfam. 

[0090] The 205 clones shown below are clones which were unassignable to any of the above-mentioned categories, 
but have been predicted to have some functions based on homology search using theirf ull-length nucleotide sequences 
25 and motif search in their estimated ORFs. Clone Name, Definition in the result of homology search or Motif Name in 
the motif search, demarcated by a double slash mark (//), are shown below. 
ADRGL20022600//DIAPHANOUS PROTEIN HOMOLOG 1 (P140MDIA). 
ADRGL20023920//ABC1 PROTEIN HOMOLOG PRECURSOR. 

ASTR020001910//Rattus norvegicus m RNA for annexin V-binding protein (ABP-10), partial cds. 
30 ASTR020009140//PUTATIVE COMPETENCE-DAMAGE PROTEIN. 
ASTR020046280//PSU1 PROTEIN. 

ASTR020058960//DNA damage inducible protein homolog - fission yeast (Schizosaccharomyces pombe) 
BNGH420024870//C2 domain// C2 domain// C2 domain 
BRACE20007330//RING CANAL PROTEIN (KELCH PROTEIN). 
35 BRACE20052430//Homo sapiens AMSH mRNA, complete cds. 

BRACE20054600//Xenopus taevis mRNA for Kielin, complete cds. 
BRACE20059810//TSC-22/dip/bun family 

BRACE20063540//MEROZOITE SURFACE PROTEIN CMZ-8 (FRAGMENT). 
BRACE20079200//Xenopus laevis mRNA for Kielin, complete cds. 
40 BRAMY20016780//Proprotein convertase P-domain 
BRAMY20023640//UBX domain 

BRAMY20045420//Domain found in Dishevelled, Egl-10, and Pleckstrin 
BRAMY20056840//UBE-1C2 

BRAMY20063750//Homo sapiens HRIHFB2007 mRNA, partial cds. 
45 BRAMY20102900//Homo sapiens RU1 (RU1) mRNA, complete cds. 
BRAMY20158550//CALMODULIN. 

BRAMY20223010//Mus musculus leucine-rich glioma-inactivated 1 protein precursor, (Lgi1) mRNA, complete cds. 
BRAMY20238630//TETRATRICOPEPTIDE REPEAT PROTEIN 4. 
BRAMY20245760//Araneus diadematus fibroin-4 mRNA, partial cds. 
50 BRAWH20047790//HMG (high mobility group) box 

BRSSN2000561 0//Mus musculus semaphorin cytoplasmic domain-associated protein 3A (Semcap3) mRNA, complete 
cds. 

BRSSN20005660//Bacterial type II secretion system protein 

BRSSN20093890//Homo sapiens mRNA for Kelch motif containing protein, complete cds. 
55 CTONG200411 50//Streptomyces ansochromogenes strain 7100 SanE (sanE) gene, complete cds. 
CTONG20066110//Homo sapiens DEME-6 mRNA, partial cds. 
CTONG20069420//Ribosomal protein S14p/S29e 
CTONG20071 040//BETA CRYSTALLIN B2 (BP). 
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CTONG20074170//DENN (AEX-3) domain 
CTONG20083430//Nuclear transition protein 2 

CTONG20170940//MYOTROPHIN (V-1 PROTEIN) (GRANULE CELL DIFFERENTIATION PROTEIN). 
CTONG201 74290//TRICHOHYALIN. 
5 CTONG201 74580//Homo sapiens mRNA for vascular Rab-GAP/TBC-containing protein complete cds. 
CTONG20180690//Collagen triple helix repeat (20 copies) 
CTONG201 86550//cca3 protein - rat 
CTONG20188080//TPR Domain 

FCBBF10004760//Homo sapiens GAP-like protein (N61) mRNA, complete cds. 
io FCBBF20033360//RING CANAL PROTEIN (KELCH PROTEIN). 
FCBBF20041380//SAM domain (Sterile alpha motif) 
FCBBF20043730//UBA domain 

FCBBF20056580//Mus musculus NSD1 protein mRNA, complete cds. 
FCBBF20059660//TPR Domain 
15 FCBBF30019180//SERINE/THREONINE PROTEIN PHOSPHATASE 2A, 65 KDA REGULATORY 

SUBUNIT A, ALPHA ISOFORM (PP2A, SUBUNIT A, PR65-ALPHA ISOFORM) (PP2A, SUBUNIT A, R1 -ALPHA ISO- 
FORM). 

FCBBF30026580//Homo sapiens retinoblastoma-associated protein RAP140 mRNA, complete cds. 
FCBBF30035570//C2 domain 
20 FCBBF30079770//D-isomer specific 2-hydroxyacid dehydrogenases 

FCBBF30100120//Mus musculus semaphorin cytoplasmic domain -associated protein 3A (Semcap3) mRNA, complete 
cds. 

FCBBF30100410//Mus musculus testis-specific Y-encoded-like protein (Tspyh) mRNA, complete cds. 
FCBBF30118890//Drosophila melanogaster La related protein (larp) mRNA, partial cds. 
25 FCBBF301 38000//trg protein - rat 

FCBBF30157270//Rattus norvegicus PAP IN mRNA, complete cds. 

FCBBF30161 780//gag gene protein p24 (core nucleocapsid protein)// Zinc knuckle 

FCBBF301 98670//dof protein - fruit fly (Drosophila melanogaster) 

FCBBF30222910//Mus musculus Rap2 interacting protein 8 (RPIP8) mRNA, complete cds. 
30 FCBBF30255680//Rattus norvegicus brain specific cortactin-binding protein CBP90 mRNA, partial cds. 
FCBBF30260210//Drosophila melanogaster KiSMET-L long isoform (kis) mRNA, complete cds. 
FCBBF30282020//cca3 protein - rat 

FCBBF4000061 0//late gestation lung 2 protein [Rattus norvegicus]. 
FEBRA20029620//Leucine Rich Repeat// Leucine Rich Repeat// Leucine Rich Repeat 
35 FEBRA20031150//Homo sapiens HSKM-B (HSKM-B) mRNA, complete cds. 
FEBRA20038330//Corticotropin-releasing factor family 
FEBRA20038970//Homo sapiens mRNA for stabilin-1 (stabl gene). 
FEBRA2008861 0//CELLULAR RETINALDEHYDE-BINDING PROTEIN (CRALBP). 
FEBRA201 50420//HYPOTHET1CAL 131.5 KDA PROTEIN C02F12.7 IN CHROMOSOME X. 
FEBRA201 75330//D-isomer specific 2-hydroxyacid dehydrogenases 
HEART10001420//Mus musculus skm-BOP1 (Bop) mRNA, complete cds. 
HLUNG20024050//Rubredoxin 

HLUNG20030420//Mus musculus mRNA for MAIL, complete cds. 
HLUNG20030490//Ambystoma tigrinum RPE65 protein mRNA, complete cds. 
45 HLUNG20033060//Homo sapiens GAP-like protein (N61) mRNA, complete cds. 

HLUNG20041590//ubiquitous tetratricopeptide containing protein RoXaN [Homo sapiens]. 
HLUNG20045340//MOB2 PROTEIN (MPS1 BINDER 2). 
HLUNG20051330//FHIPEP family 

HLUNG20070410//Dihydropyridine sensitive L-type calcium channel (Beta subunit) 
50 HLUNG20072100//Gallus gallus Dach2 protein (Dach2) mRNA, complete cds. 

HLUNG20083480//Chicken mRNA for TSC-22 variant, complete cds, clone SLFEST52. 
KIDNE20027980//SAM domain (Sterile alpha motif) 

KIDNE20084730//Homo sapiens FH1/FH2 domain-containing protein FHOS (FHOS) mRNA, complete cds. 
KIDNE20T497807/NG28 [Mus musculus] 
55 KIDNE20154330//Rattus norvegicus mRNA for multi PDZ domain protein. 

KIDNE20170400//Leucine Rich Repeat// Leucine Rich Repeat// Leucine Rich Repeat// Protein kinase C terminal do- 
main// Rubredoxin 

K1DNE20189890//Homo sapiens mRNA for KARP-1 -binding protein 2 (KAB2), complete cds. 
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LIVER20010760//Homo sapiens C-type lectin-like receptor-1 mRNA, complete cds. 
LIVER20040740//RETINAL-BINDING PROTEIN (RALBP). 
MESAN20009090//Homo sapiens CEGP1 protein (CEGP1), mRNA 
MESAN20026870//PAN domain// TBC domain 
5 MESAN200901907/CEGP1 protein [Homo sapiens]. 

NT2NE20059680//Homo sapiens integrin cytoplasmic domain associated protein (lcap-1a) mRNA, complete cds. 
NT2NE20077270//Adenovirus EB1 55K protein / large t-antigen 

NT2NE20087850//ANTER-SPECIFIC PROLINE-RICH PROTEIN APG (PROTEIN CEX) (FRAGMENT). 
NT2NE20095230//Homo sapiens HSKM-B (HSKM-B) mRNA, complete cds. 
10 NT2NE20108420//KES1 PROTEIN. 

NT2NE20173970//Rattus norvegicus beta-catenin binding protein mRNA, complete cds. 
NT2NE20177210//Leishmania major partial ppgl gene for proteophosphoglycan. 
NT2RP70012830//CALPHOTIN. 

NT2RP70035110//Caenorhabditis elegans UNC-89 (unc-89) gene, complete cds. 
15 NTONG20002230//Mus musculus RW1 protein mRNA, complete cds. 

NTONG20005310//Ribosomal protein S9/S16 

NTONG20029850//CALCYPHOSINE (R2D5 ANTIGEN). 

NTONG20035150//RING CANAL PROTEIN (KELCH PROTEIN). 

NTONG20058220//Homo sapiens phosphoprotein pp75 mRNA, partial cds. 
20 OCBBF20005220//Rattus norvegicus Fos-related antigen mRNA, complete cds. 

OCBBF20011860//Mus musculus epithelial protein lost in neoplasm-a (Eplin) mRNA, complete cds. 

OCBBF20016810//enhancer of polycomb [Mus musculus] 

OCBBF20147070//DNA polymerase (viral) C-terminal domain 

OCBBF20160380//liver stage antigen LSA-1 - Plasmodium falciparum OCBBF201 77910//Corticotropin-releasing fac- 
25 tor family 

PEBLM20005020//Virion host shutoff protein 

PLACE60055460//Homo sapiens leucine-zipper protein FKSG13 (FKSG13) mRNA, complete cds. 

PLACE6006871 0//SUPPRE PROTEIN SRP40. 

PLACE60080360//mucin [Homo sapiens] 
30 PLACE60082850//Pathogenesis-related protein Bet v I family 

PLACE60098350//Human hepatocellular carcinoma associated protein (JCL-1) mRNA, complete cds. 

PLACE601 05680//Homo sapiens mRNA for TU1 2B1 -TY, complete cds. 

PLACE601 1 97007/Homo sapiens mRNA for ABP32, complete cds. 

PLACE601 20280//SER/THR-RICH PROTEIN T10 IN DGCR REGION. 
35 PLACE60132200/yTRICHOHYALIN. 

PLACE60181870//Pentaxin family 

PROST20084470//Plasmodium berghei strain NYU2 merozoite surface protein-1 mRNA, partial cds. 
PROST20087240//gag gene protein p24 (core nucleocapsid protein) 
PROST20122490//Gallus gallus syndesmos mRNA, complete cds. 
40 PROST20130320//S-1 00/ICaBP type calcium binding domain 
PROST20152870//Homo sapiens APC2 gene, exon 14. 

PUAEN10001640//Mus musculus cerebellar postnatal development protein-1 (Cpd1) mRNA, partial cds. 

PUAEN20000800//Bleomycin resistance protein 

SMINT20012220//Collagen triple helix repeat (20 copies) 
45 SMINT20035510//Drosophila melanogaster La related protein (larp) mRNA, partial cds. 

SMINT20036440//Drosophila melanogaster epsin-like protein mRNA, complete cds. 

SMINT20038660//Homo sapiens HNOEL-iso (HNOEL-iso) mRNA, complete cds. 

SMINT20043390//Ras association (RalGDS/AF-6) domain 

SMINT20048720//Cytochrome P450// Cytochrome P450 
50 SMINT20052130//Rattus norvegicus mRNA for gankyrin homologue, complete cds. 

SMINT20054050//ABC1 PROTEIN HOMOLOG PRECURSOR. 

SPLEN20024770//Rattus norvegicus (rsec6) mRNA, complete cds. 

SPLEN20040780//CORNIFIN B (SMALL PROLINE-RICH PROTEIN 1B) (SPR1B) (SPR1 B). 
SPLEN20041810//BC-2 protein [Homo sapiens] 
55 SPLEN20100040//258.1 KDA PROTEIN C210RF5 (KIAA0933). 
SPLEN20104150//Ribosomal protein L36 
SPLEN20116720//Homo sapiens misato mRNA, partial cds. 
SPLEN20176130//Homo sapiens mRNA for ALEX1 , complete cds. 
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SPLEN201 81 570//TRICHOHYALIN. 
TESTI20004310//TRICHOHYALIN. 
TESTI20016970//TPR Domain 
TESTI20030440//TRICHOHYALIN. 
5 TESTI20043180//mouse mRNA for megakaryocyte potentiating factor, complete cds. 

TESTI2004391 0//IQ calmodulin-binding motif// IQ calmodulin-binding motif// IQ calmodulin-binding motif// IQ calmod- 
ulin-binding motif// IQ calmodulin-binding motif 

TESTI20044900//Strongylocentrotus purpuratus radial spokehead mRNA, complete cds. 
TESTI20646110//Extracellular link domain 
io TESTI20047930//Homo sapiens NY-REN-2 antigen mRNA, complete cds. 
TESTI20049410//Proprotein convertase P-domain 
TESTI20053950//IQ calmodulin-binding motif 

TESTI20054700//Streptbcoccus pneumoniae strain g375 surface protein PspC (pspC) gene, pspC-8. 1 allele, complete 
cds. 

15 TESTI20055880//Serum amyloid A protein 

TESTI20056030//Homo sapiens 88-kDa Golgi protein (GM88) mRNA, complete cds. 
TESTI20061090//Keratin, high sulfur B2 protein 

TESTI20064370//TPR Domain// TPR Domain// TPR Domain// TPR Domain// Synaptobrevin 
TESTI20084250//OXYSTEROL-BINDING PROTEIN. 
20 TESTI200921 70//ENV polyprotein (coat polyprotein) 
TESTI20116050//UBX domain 
TESTI20120500//Kelch motif// Kelch motif 

TESTI20126280//Mus musculus STAP mRNA for sperm tail associated protein, complete cds. 
TESTI20144390//TESTIS-SPECIFIC PROTEIN PBS13. 
25 TESTI201 659907/Ribosomal protein L36 

TESTI201 69500//HYPOTHETICAL 51 .9 KDA PROTEIN C27F1 .04C IN CHROMOSOME I. 
TESTI20170280//Flagellar L-ring protein 

TESTI20176450/Ahioredoxin interacting factor [Mus musculus]. 

TESTI20179230//Dihydropyridine sensitive L-type calcium channel (Beta subunit) 
30 TESTI201 806007/Homo sapiens HOM-TES-85 tumor antigen mRNA, complete cds. 

TESTI20209050//HYPOTHETICAL 113.1 KDA PROTEIN IN PRE5-FET4 INTERGENIC REGION. 

TESTI20210570//RETINAL-BINDING PROTEIN (RALBP). 

TE STI 2021 531 07/Homo sapiens calcyclin binding protein mRNA, complete cds. 

TESTI20247440//Human BLu protein testis isoform (BLu) mRNA, complete cds. 
35 TESTI20249360//Homo sapiens DEME-6 mRNA, partial cds. 

TESTI20250220//TRICHOHYALIN. 

TESTI20251 440//Rattus norvegicus (rsec6) mRNA, complete cds. 
TESTI20255460//Mus musculus mRNA for MIWI (piwi), complete cds. 
THYMU20009500//TPR Domain 
40 THYMU20010180//MOB1 PROTEIN (MPS1 BINDER 1). 

THYMU20013810//Human SEC7 homolog Tic (TIC) mRNA, complete cds. 
THYMU20018250//TPR Domain 

THYMU200269507/Mus musculus ROSA 26 transcription AS ROSA26AS mRNA, complete cds. 
THYMU20028410//Mus musculus Pax transcription activation domain interacting protein PTIP mRNA, complete cds. 
45 THYMU20030460//Homo sapiens tumor endothelial marker 7 precursor (TEM7) mRNA, complete cds. 
THYMU20031330//Homo sapiens putative nucleotide binding protein mRNA, complete cds. 
THYMU20052460//PHORBOLIN I (FRAGMENTS). 
THYMU20055450//Zona pellucida-like domain 

THYMU20083830//Homo sapiens angiostatin binding protein 1 mRNA, complete cds. 
50 THYMU201391 60//Uncharacterized protein family UPF0031 

THYMU20151610//Homo sapiens antigen NY-CO-1 (NY-CO-1) mRNA, complete cds. 
TRACH20093400//TRICHOHYALIN. 

TRACH20104510//Uncharacterized protein family UPF0005 
TR ACH20 1 22980//H Y POTH ETi C AL PROTEIN MJ0798. 
55 TRACH201 39280//PX domain 

TRACH20164810//D-isomer specific 2-hydroxyacid dehydrogenases 

TRACH20165540//Human alpha-1 type I collagen gene surrounding osteogenesis imperfecta 01 type II deletion. 
UTERU20051790//guanylate kinase-interacting protein 1 Maguin-1, membrane-associated - rat 
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UTERU20083020//Domain of unknown function DUF71 
UTERU201 21 1 40//Rhodanese-like domain 

UTERU20128560//26.4 KDA PROTEIN IN RUVC-ASPS INTERGENIC REGION. 
UTERU20132620//AXONEME-ASSOCIATED PROTEIN MST101(2). 
5 UTERU20134830//pellino (Drosophila) homolog 2 [Homo sapiens] 
UTERU201 81 270//Zinc knuckle 

[0091] Further, the reason is that a polypeptide does not always belong solely to a single category of the above- 
described functional categories, and therefore, a polypeptide may belong to any of the predicted functional categories. 
Besides, additional functions can be found for the clones classified into these functional categories by further analyses. 
w [0092] Since the polypeptide encoded by clones of the invention contains fuli-length amino acid sequence, it is pos- 
sible to analyze its biological activity, and its effect on cellular conditions such as cell proliferation and differentiation 
by expressing the polypeptide as a recombinant polypeptide using an appropriate expression system, injecting the 
recombinant into the cell, or raising a specific antibody against the polypeptide. 

[0093] The biological activities of respective polypeptides can be analyzed by the methods as shown below. 

15 

Secretory protein, transmembrane protein: 
[0094] 

20 "ion Channels" (Ed., R. H. Ashley, 1995) of "The Practical Approach Series" (IRL PRESS), 

"Growth Factors" (Eds., I. McKay, I. Leigh, 1993), 
"Extracellular Matrix" (Eds., M. A. Haralson, J. R. Hassell, 1995); 

Glycoprotein-related protein: 

25 

[0095] 

"Glycobiology" (Eds., M. Fukuda, A. Kobata, 1993) of "The Practical Approach Series*' (IRL PRESS), 
"Glycoprotein Analysis in Biomedicine" (Ed., Elizabeth F.Hounsell, 1993) of "Method in Molecular Biology" (Hu- 
so mana Press) series; 

Signal transduction-related protein: 

[0096] 

35 

"Signal Transduction" (Ed., G. Milligan, 1992) of "The Practical Approach Series" (IRL PRESS), 
"Protein Phosphorylation" (Ed., D. G. Hardie, 1993), or 

"Signal Transduction Protocols" (Eds., David A. Kendall, Stephen J.Hill, 1995) of "Method in Molecular Biology" 
(Humana Press) series; 

40 

Transcript ion -related protein: 
[0097] 

45 "Gene Transcription" (Eds., B. D. Hames, S. J. Higgins, 1993) of "The Practical Approach Series" (IRL PRESS), 

"Transcription Factors" (Ed., D.S.Latchman, 1993); Enzyme and/or metabolism-related protein: 
"Enzyme Assays" (Eds., ROBERT EiSENTHAL and MICHAEL J. DANSON, 1992) of "The Practical Approach 
Series" (IRL PRESS); Cell division and/or cell proliferation-related protein: 

"Cell Growth, Differentiation and Senescence" (Ed., GEORGE STUDZINSKI, 2000) of The Practical Approach 
50 Series" (IRL PRESS); 

Cytoskeleton -related protein: 

[0098] 

55 

"Cytoskeleton: Signalling and Cell Regulation" (Eds., KERMIT L. CARRAWAY and CAROLIE A. CAROTHERS 
CARRAWAY, 2000) of "The Practical Approach Series" (IRL PRESS), 

"Cytoskeleton Methods and Protocols" (Ed., Gavin, Ray H., 2000) of "Method in Molecular Biology" (Humana 
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Press) series; Nuclear protein and/or RNA synthesis-related protein: 

"Nuclear Receptors" (Ed., DIDIER PICARD, 1999) of 'The Practical Approach Series" (IRL PRESS), 
"RNA Processing" (Eds., STEPHEN J. HIGGINS and B. DAVID HAMES, 1994); 

5 Protein synthesis and/or transport-related protein: 

[0099] 

"Membrane Transport" (Ed., STEPHEN A. BALDWIN, 2000) of "The Practical Approach Series" (IRL PRESS) 
10 "Protein Synthesis Methods and Protocols" (Eds., Martin, Robin, 1 998) of "Method in Molecular Biology" (Humana 

Press) series; 

Cellular defense-related protein: 

15 [0100] 

"DNA Repair Protocols" (Henderson, Daryl S., 1999) of "Method in Molecular Biology" (Humana Press) series, 
"Chaperonin Protocols" (Eds., Schneider, Christine, 2000); Development and/or differentiation-related protein: 
"Developmental Biology Protocols" (Eds., ROBERT EISENTHAL and MICHAEL J. DANSON, 1992) of "Method in 
20 Molecular Biology" (Humana Press) series; 

DNA- and/or RNA-binding protein: 

[0101] 

25 

"DNA-Protein Interactions Principles and Protocols" (Eds., Kneale, G. Geoff, 1994) of "Method in Molecular Biol- 
ogy" (Humana Press) series, 

"RNA-Protein Interaction Protocols" (Eds., Haynes, Susan R., 1999); 
30 ATP- and/or GTP-binding protein: 
[0102] 

"Signal Transduction Protocols" (Eds., David A. Kendall, Stephen J. Hill, 1995) of "Method in Molecular Biology" 
35 (Humana Press) series. 

[0103] In the categorization, the clone predicted to belong to the category of secretory and/or membrane protein 
means a clone having hit data with some annotation, such as growth factor, cytokine, hormone, signal, transmembrane, 
membrane, extracellular matrix, receptor, G-protein coupled receptor, ionic channel, voltage-gated channel, calcium 
40 channel, cell adhesion, collagen, connective tissue, etc., suggesting that it was a secretory or membrane protein, or 
a clone in which the presence of nucleotide sequence encoding a signal sequence or transmembrane region was 
suggested by the results of PSORT and SOSUl analyses for deduced ORF. 

[0104] The clone predicted to belong to the category of glycoprotein-related protein means a clone having hit data 
with some annotation, such as glycoprotein, suggesting that the clone encodes a glycoprotein-related protein. 
45 [01 05] The clone predicted to belong to the category of signal transduction-related protein means a clone having hit 
data with some annotation, such as serine/threonine-protein kinase, tyrosine-protein kinase, SH3 domain, SH2 domain, 
etc., suggesting that the clone encodes a signal transduction-related protein. 

[0106] The clone predicted to belong to the category of transcription-related protein means a clone having hit data 
with some annotation, such as transcription regulation, zinc finger, homeobox, etc., suggesting that the clone encodes 

50 a transcription- related protein. 

[0107] The clone predicted to belong to the category of disease- related protein means a clone having hit data with 
some annotation, such as disease mutation, syndrome, etc., suggesting that the clone encodes a disease-related 
protein, or a clone whose full-length nucleotide sequence has hit data for Swiss-Prot, GenBank, UniGene, or nr, where 
the hit data corresponds to genes or polypeptides which have been deposited in the Online Mendelian Inheritance in 

55 Man (OMIM) (http://www.ncbi.nlm.nih.gov/Omim/), which is the human gene and disease database described later. 
[0108] The clone predicted to belong to the category of enzyme and/or metabolism-related protein means a clone 
having hit data with some annotation, such as metabolism, oxidoreductase, E. C. No. (Enzyme commission number), 
etc., suggesting that the clone encodes an enzyme and/or metabolism-related protein. 
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[0109] The clone predicted to belong to the category of cell division and/or cell proliferation-related protein means 
a clone having hit data with some annotation, such as cell division, cell cycle, mitosis, chromosomal protein, cell growth, 
apoptosis, etc., suggesting that the clone encodes a cell division and/or cell pro I iteration -related protein. 
[0110] The clone predicted to belong to the category of cytoskeleton -related protein means a clone having hit data 
5 with some annotation, such as structural protein, cytoskeleton, actin-binding, microtubles, etc., suggesting that the 
clone encodes a cytoskeleton -related protein. 

[0111] The clone predicted to belong to the category of nuclear protein and/or RNA synthesis-related protein means 
a clone having hit data with some annotation, such as nuclear protein, RNA splicing, RNA processing, RNA helicase, 
polyadenylation, etc. : suggesting that the clone encodes a nuclear protein and/or RNA synthesis-related protein. 
10 [0112] The clone predicted to belong to the category of protein synthesis and/or transport-related protein means a 
clone having hit data with some annotation, such as translation regulation, protein biosynthesis, amino-acid biosyn- 
thesis, ribosomal protein, protein transport, signal recognition particle, etc., suggesting that the clone encodes a protein 
synthesis and/or transport- related protein. 

[0113] The clone predicted to belong to the category of cellular defense- related protein means a clone having hit 
15 data with some annotation, such as heat shock, DNA repair, DNA damage, etc., suggesting that the clone encodes a 
cellular defense-related protein. 

[0114] The clone predicted to belong to the category of development and/or differentiation-related proteins means 
a clone having hit data with some annotation, such as developmental protein, etc., suggesting that the clone encodes 
a development and/or differentiation-related protein. 
20 [0115] The clone predicted to belong to the category of DNA-and/or RNA-binding protein means a clone having hit 
data with some annotation, such as DNA-binding, RNA-binding, etc. 

[0116] The clone predicted to belong to the category of ATP-and/or GTP-binding protein means a clone having hit 
data with some annotation, such as ATP-binding, GTP-binding, etc. 

[0117] As to a protein involved in a disease, it is possible to perform a functional analysis as described above, but 
25 also possible to analyze correlation between the expression or the activity of the protein and a certain disease by using 
a specific antibody that is obtained by using expressed protein. Alternatively, it is possible to utilize the database OMIM, 
which is a database of human genes and diseases, to analyze the protein. Further, new information is constantly being 
deposited in the OMIM database. Therefore, it is possible for one skilled in the art to find a new relationship between 
a particular disease and a gene of the present invention in the most up-to-date database. The proteins involved in 
30 diseases are useful for developing a diagnostic marker or medicines for regulation of their expression and activity, or 
as a target of gene therapy. 

[0118] Also, as for a secretory protein, membrane protein, signal transduction-related protein, glycoprotein -related 
protein, or transcription-related protein, etc., search of the OMIM with the following keywords resulted in the finding 
that the proteins are involved in many diseases (the result of the OMIM search for secrete and membrane proteins is 

35 shown below). Also, association between proteins related to signal transduction or transcription and diseases is re- 
ported in "Transcription Factor Research-1 999" (Fujii, Tamura, Morohashi, Kageyama, and Satake edit, (1999) Jikken- 
Igaku Zoukan, Vol.17, No.3), and "Gene Medicine" (1999) Vol.3, No. 2). When cancer is used as an example, as de- 
scribed in "Biology of Cancer" (S. Matsubara, 1992) of Life Science series (Shokabo), many proteins are involved in 
cancers, which include enzyme and/or metabolism-related proteins, cytoskeleton-related proteins, cell division and/or 

40 cell proliferation-related proteins as well as secretory proteins, membrane proteins, signal transduction-related proteins, 
glycoprotein-related proteins, transcription-related proteins. As clearly seen by the above example, it is evident that 
not only disease-related proteins but also secretory proteins, membrane proteins, signal transduction-related proteins, 
glycoprotein-related proteins, transcription -related proteins, etc. are often involved in diseases, and thus they can be 
useful targets in the field of medical industry. 

45 [0119] The result of the OMIM search for secretory and membrane proteins is shown below, in which the keywords, 

(1) secretion protein, 

(2) membrane protein, 

(3) channel, and 

50 (4) extracellular matrix were used. 

[0120] Shown in the search result are only the accession numbers in the OMIM. Using the number, data showing 
the relationship between a disease and a gene or protein can be seen. The OMIM data has been renewed everyday. 

55 1) Secretion protein 

354 entries found, searching for "secretion protein" 

*604667, *104760, *176860, *151675, *139320, *107400, *604029, *118910, #200100, M76880, *603850, 
*1 47572, *604028, M79513, *1 25950, M 39250, *246700, *600946, *600560, *602926, 1 85860, *605083, *603215, 
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'602421, '157147, '179512, '600174, '109270, '604710, '138120, '179510, '600998, '179509, '170280, 
'179511, '600626, '603831 , '601489, *1 54545, * 179490, 

'603826, '122559, '603216, '102720, '147290, '164160, '603062, *112262, '602672, '605435, '605322, 
'131230, '601652, '603166, '601746, *601591, '179508, #160900, '104311, '600759, '147545, '167805, 
5 #104300, '167770, #219700, '168470, '601684, '602049, '601146, '605227, *602434, '602534, '114840, 

*603489, '604323, '107470, '600753, '600768, '118825, '600564, 

'604252, '173120, '134370, '192340, '308230, '600322, '605359, '600046, '300090, 106160, '600041 , 
#262500, '605563, '150390, '158106, '182590, #103580, '104610, #173900, '134797, '143890, #145980, 
'306900, '308700, '176300, '227500, '137350, #154700, '138079, '600760, '107730, '142410, '147670, 

10 '124092, '590050, '152760, '600509, '605646, '201910, '227600, 

•152790, *300200,*300300, 300800, *1 381 60. *1 07741, M 201 50, *6011 99, *1201 80, M201 60, *176730,*1 331 70, 
'122560, '107300, '137241, '120140, '101000, *193400, '217000, '272800, '600937, #201710, *600377, 
#174800, '106100, #274600, '173350, #177170, '147620, '214500, '131244, '202110, '120120, '601007, 
'191160, '147470, '603372, '600733, '252800, '190160, 

15 '138040, '158070, '162151, #125700, #130070, '113811, '603355, '171060, '136435, #184700, '603732, 

'190180, '164008, '186590, '120220, '604312, '152200, '138130, '605085, '605353, '600840, #166210, 
'188545, '207750, '173360, '601933, #194050, '153450, '138850, '253200, '307030, '157145, '600514, 
'600262, '264080, *t 47380, '600281 , #204000, #227810, '232200, 

'188826, '232800, '161561, #166200, '188400, '153620, '182099, '218040, #265800, '172400, #177200, 
20 * 176805, #211600, #214700, #176410, *1 52780, '600633, '601771, '301500, '605402, '601922, '307800, 

*1 47892, '147720, '312060, #520000, '147660, '106150, '602358, '107270, '601769, '147440, *604558, 
'131530, '600270, '601610, '603692, '603401 , '600423, '601604, 

'603345, #125853, '602843, '142640, '603044, '605740, '134830, '602779, '130660, '139191, '137035, 
'600761, '601340, '600823, '107740, '130160, '600877, '605110, '600945, '130080, '600957, #130050, 
25 '605580, *1 1 8444, '601 1 24, *1 24020, 1 22470, *1 20700, '603201 , *1 3721 6, '601 1 85, *1 38945, *21 8030, '600839, 

#240600, #262400, #162300, '162330, '188450, #265850, 

'263200, '162641, '300159, '601038, #191390, '201810, '601398, '602384, '131240, '602423, '139392, 
'142703, '602663, '232700, '602682, #602722, '602730, '600734, '188540, '182452, '601538, '60306.1, 
'146880, '603140, '603160, '142704, #252650, '182280, '125255, '603252, #131750, '182139, '182100, 

30 #259420, #261100, '603493, '601745, '182098, '603795, '123812, 

'600264, '147940, '180246, '180245, '118888, #604284, '168450, '118455, '604398, '604433, '601919, 
'118445, '600031, '604961, '605032, '605033, '171050, #171300, '131243, '109160, '605254, 274900, 
#171400, '600042, '151670, '184600, '605470, '605546, '176760, '602008, '102200, '605720, '600732, 
'605901 

35 2) Membrane protein 

1489 entries found, searching for "membrane protein" 

'130500, '605704, '305360, '153330, '173610, '109270, '170995, '170993, '104776, '602333, '309060, 
'605703, '120920, '605943, '602690, '159430, '600897, '133090, '601178, '602413, '602003, '604405, 
'605940, '603237, '109280, '600378, '602173, '107776, '602334, '602335, '125305, '601134, '309845, 

40 '605731 , '154045, '603241 , '603718, '600594, '603214, '185881 , 

'603657, '600182, '603177, '605331, '601476, '605456, '601114, '605190, '600723, '603904, '136950, 
'300222, '602879, '185880, '605348, '300096, '602257, '177070, '310200, '603062, '603344, '600039, 
'602977, '300100, '128240, '600959, '600322, '227400, '186945, '600946, '602534, '602048, '182900, 
'601097, '600267, '602625, '136430, '602421 , '601 047, '107450, 

45 *143450, '603141, '184756, '164730, '159440, '154050, '600579, '312080, '604202, '603700, '600447, 

'256540, '604691, '158343, '600403, '602414, '137290, '176640, '176981, '600179, '600754, '604456, 
'604693, '605875, '604605, '188860, '300172, '602910, '604323, '219800, '601848, '603179, '600279, 
'602251, #222700, '603831, '605072, '605377, '601028, '604155, 

'108733, '104225, '601896, '601510, '173335, '107770, '601767, '600046, '603850, '600040, '603784, 
50 *603234, 1 88560, '605863, '121015, '605862, '605861 , *1 86946, '604252, '60321 5, *1 42461 , '604597, '6031 43, 

'605264, '603735, '176860, '605536, '176801, '180721, '603355, '104760, '131560, '310300, '602631, 
'304700, #309400, '603142, '143890, '605431 , '600753, 

'115501, '176790, '600266, '601691, '168468, '601239, '602216, #104300, '605613, '601595, '605550, 
'125950, '605475, '602217, '602261, '603534, '602262, '604631, '190315, '601313, '604306, '104311, 
55 *604672, '605000, '602461, '605548, '602296, '604376, '121014, '121011, '600691, '604262, '139310, 

'304040, '605445, '179514, '179512, '151460, #160900, '120130, 

'128239, '601 158, '601 403, '176943, '601 01 4, 300800, '300294, '601 757, *1 85470, '273800, '605034, '602887, 
#185000, '604871, '603593, '603583, '605454, '104775, '605872, '141180, '602713, '603531, '139150, 
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"601531 

"604699 

"300023 

"305100 

"604241 

"600481 

"604157 

*605058 

"600168 

"604721 

"605787 

185860, 

"602701 

"1 80380 

"131550, 

*1 55975 

*120131 

*605699, 

*1 62060, 

"605496, 

"605101 

* 192974 

"6041 96 

*604418, 

"600932, 

"604762 

*300242 

"602370 

*601275 

*602500 

*156225 

•113811, 

•179605 

*1 10900, 

•601567 

•601383 

•603967 

•604235 

•152790 

•173350 

•189980 

•190000, 

•604477 

#269920 

•604889 

•602924 

•603434 

#145500 

•603728, 

•130130, 

•605245, 

•312610, 

•173510 

•307800 

•602122 

•146710 

•601843 

•601193, 



•601832, *605452, *134651, "604156, *120620, "605883, *604142, *166945, "605324, "600816, 
•300112, "605182, *600164, *182180, *605071 , 

•605057, *308240, "300249, *1 76947, *1 76894, *605081 , "605035, "602044, *1 82860, *1 07271, 
•153390, *113730, *602689, *180069, *603518, *300017, "191275, *177061, *601693, *601789 ! 
•600934, *1 381 60, "604424, *603868, *600174, *600718, *600523, *604141, *601009, "605251, 
•600874, *155550, "605227, "601017, *162230, 601138, 

•601212, *600763, *604110, *604158, "601107, "601326, 600621 , "600587, 601137, "600917, "600855, 
•194355, *605194, "603291, "102720, "136425, "170715, "603216, "605547, "135630, "602926, 
"605002, "602474, "600157, "603025, "603893, "231200, "120090, "601966, "131230, "604722, 

•604515, *246700, "602101 , "605628, "303630, 

"602857, "602285, "605708, "602488, "605025, "60381 7, "300051 , "603293, "1 76878, "603646, 605707, 
112205, "300187, "602654, "120070, "603648, "604850, "602655, "602514, "300118, "182309, "179590, 
"600759, "204200, "604170, "175100, #103580, "147670, "306400, "143100, "182870, "257220, 
#116920, "301000, "193300, "157147, 

•139200, "139130, "190195, "605406, "155760, "155960, "605734, "155970, "605385, "111700, 
•150370, 605709, *151430, *605438, *15151 0, *116952, *157655, *1581 05, *605777, *1 76877, *1 5361 9, 

•185430, *109190, *120190, *109170, *605093, *605250, "153432, "107777, "186590, "160993, 
•605698, *605813, *605697, *605616, *605300, 

•605219, *1 63970, *1 35620, *1 65040, "605478, "604964, "103195, "604932, "604923, "605906, 
"60591 4, "1 66490, 1 38277, "60491 5, "1 1 4070, "60521 3, "605933, "1 80297, "101 000, "1 91 1 63, "1 91 1 64, 

"603167, "600772, "603164, "600708, "604001, "191328, "313440, "602672, "604009, "604299, 
"604256, "603048, "600515, "604221, "602632, 

*601 1 79, 603290, *604661 , *601 023, *601 110, *304800, *203200, *30021 2 , "602933, "603352, "208900, 
"604838, "600551, #212140, "604837, "602049, "600552, "600553, "300213, "602574, "600583, 
"603452, "604775, "516020, "604617, "604464, "603498, "300145, "601523, "602694, "600632, 

"604492, "400015, "604504, "601717, "601728, 

"602426, "604194, "603821, "604730, "600695, "603823, "603869, "300241, "600707, "603822, 
"602202, "604193, "601181, "604089, "602507, "604195, "602306, "300284, "601805, "601895, 
•604660, "600752, "603820, "604192, "602207, "308230, "600894, "312600, "603199, "604029, 
"102680, "235200, #256300, "601633, #219700, 262890, 

•173470, "193400, *173910, *600354, '113705, "600065, "107741, "107400, "600024, "131195, 
#118220, "601638, "300011, "276903, "604144, *311770, "601758, #173900, "604592, "120120, 
•603130, *603372, "110750, "222900, "602509, "256100, "602469, "602281, "229300, "224100, 
"190180, "261600, "602997, "603616, "603189, 601791, 

"312700, "171060, "308700, "604027, "162643, "516000, "176261, "604028, "314850, #145980, 
•600930, *305900, "601253, *1 36350, *605537, *138140, *604033, *605070, *1 39250, "300500, 
"300041, "603866, #130600, "120150, "601050, "604942, "605204, "605248, "272750, "600163, 
"600682, "107266, "306900, "191092, #262500, "600106, 

•186720, *227650, "153700, *308380, "103390, "605646, "164920, "604478, #252650, "173850, 
•602505, *246530, "194380, *602575, *603030, #209920, *212138, #214100, *605767, *600582, 
#176200, *604653, *604678, *256550, *300037, *253700, #253300, #226700, "604766, #244400, 
"188040, "604824, "214500, #237500, "232300, "605014, 

"190930, "605124, "604475, "604594, #227810, "306700, #301050, "600135, "600143, "605145, 
, "300104, "277900, "300135, "300231, "192500, "182138, "191190, "176805, "600185, "186591, 

"603051, *165360, *147545, *601040, #156575, *107269, "603009, *602934, "123825, "601081, 

•163890, *600381, *602909, "150330, *109690, "123900, 

•603491, *110700, "602581, "125647, #154700, "114760, 
, "601309, "602382, "120325, "600877, "604205, *604090, 

•120950, '604026, *600580, *601610, *137167, *603960, 

•601997, "601975, "600395, "516040, "600418, "600650, 

"605172, "600509, "164761, "310400, "600308, "605109, "600544, "600359, "600103, "605267, 
•176100, *308100, *158070, *605123, "173325, #312750, "600839, *158120, #604369, *604465, 
#161200, *151525, *605369, *604237, "516050, #600886, *604517, "165180, *605381, "605399, 
"604365, "155740, "147795, 601709, "604673, "147730, 

•147557, "193245, "600978, "604990, "603261, "603274, *601007, *131100, *602941 , "107941, 
•276901, *131244, *602872, *603411, *186357, "176290, *601066, *185050, *232200, "143030, 
#236700, "604122, "142800, "134638, "604985, "182380, "603930, "142410, "137060, "604586, 
"120650, "252500, "253800, "120930, "604858, "605874, 



•141900, "603690, "120220, *601199, 
•601497, *602377, *605464, *1 38720, 
•603931, *601880, *603126, "138190, 
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601274, '602158, '605873, '193210, '203100, '601295, '604095, #201710, '126150, '108740, #205400, 
'601373, '300167, '109545, '602894, '603361, #300257, '266200, '603401, '131390, '180470, '605908, 
'604798, #221770, '223360, *1 80901, '605641, '605745, '604018, '300200, '604603, '230800, '602676, 
#604004, '605692, '602640, '601599, '134637, '245900, '118425, 

601614, '605725, '120110, '300189, '300035, '603102, '250800, '602282, '602458, '12361 0, '603754, '300278, 
*601463, '300224, *601581, '182160, '601653, *139191, '601733, '600748, *142460, '601194, '152390, 
*153620, '601615, *601814, '601617, '601613, '300191, #308300, '600798, 601858, '601872, *601597, 
#601588, *600821 , *1 47840, *1 52427, *1 38850, '600823, 

'601492, '300256, '600840, '300267, '601411 , '139080, '139090, 600851 , '300334, *1 79080, '602095, '601284, 
'601282, #177200, '601681, '601252, '176000, '602184, '602188, #266510, #154020, '186711, '257200, 
'601711, '600667, '602241, '186745, '255125, '300126, '600644, '123890, #255120, #175200, '600004, 
'302060, '123580, '186760, '122561 , '602316, '600017, 

*1 20940, 1 40300, '151 690, *1 20700, '602354, '60001 9, '600857, *1 821 75, '600536, *1 58380, '60051 6, *1 20290, 
'600493, '182310, #252010, '182530, '186830, '601839, '142790, '159465, '118990, '250790, '248600, 
#248250, '186845, '601153, '142600, '116930, '114860, '171834, #303600, *1 86880, '600444, '142871, 
'601852, '602602, '602607, '114207, '186910, #232220, 

600880, '134635, '112203, #1121 00, '111680, '231 680, '311 030, '111 250, '111200, '134390, #226670, #145600, 
'226200, '602714, '171760, '133550, '602727, '161555, '602744, '602746, #131705, '602835, '600423, 
'176267, '602859, #600918, 277175, '602874, '601020, '109770, '600170, '217070, '173515, '602893, 
'147280, '154360, '171050, '108780, '176257, '600979, 

'600377, '108360, '204500, '170260, '146880, '154582, '601011, '600997, '602992, '201475, '603005, 
'190198, '147360, #270400, '600238, #164970, '306250, #126600, '193065, #181350, '106180, '602136, 
'600937, '603086, '603087, '307030, '182099, '103320, '601683, #192430, '103180, '102681, '192321, 
'600244, '191740, '191315, '603152, '102642, '191305, #266140, 

'100500, *600867, '604585, '604404, '604345, '603201, '605430, '603207, '603208, '605433, '604101, 
'603969, *605896, '604616, '605851, '605768, '604576, '605754, '605730, '605477, '603263, '605538, 
'603283, '604402, '605453, '605427, '603302, '605458, 603313, '604415, '603345, '605541 , '603353, '605295, 
'603879, '605268, '605266, '605246, '603377, '603380, 

'605181, *604203, '603425, '603867, '605106, '605017, '603842, '604936, '603510, '604857, '605932, 
'605816, *603765, '603551, '605357, '605237, '604204, '603594, '605110, '604190, '603861, '604962, 
'603639, '603644, '605007, '605349, '604943, '60491.8, '604907, '603667, '603681, '605396, '605561, 
'603712, '603713, '605688, '605942, '604878, '604843, '604659, 

'604671, '603798, '604682, '604056, '604705, '603749, 602586, '603647, '602515, #602475, '603717, 
'602359, '602372, '602380, '60251 8, '603652, '602573, '603626, 602587, '603598, '602871 , '60361 3, '603750, 
'603875, '602608, '602666, '602345, '602935, '603564, '603548, '603927, 601876, '602343, '603943, '603787, 
'601730, '601611, '602679, '603788, '602243, 

603790, '601535, '603796, '601488, '601485, '60231 4, '601478, '604047, '604048, '602297, '604057, '602715, 
'602192, '601459, '601416, '603833, '602190, '604102, '602106, '604111, '602724, '603499, '602736, 
'601123, '601002, '600923, '601987, '604149, '601929, '600910, '600900, '600864, '604165, '600782, 
'602836, '600769, '600742, '602783, '601905, '600535, 

'604198, '601901, '600534, '602876, '603356, '600530, '604216, '604217, '602890, '602905, '600465, 
'600464, '600446, '602891, '603366, '601894, '604272, '603926, '603312, '600368, '602914, '600327, 
'603151, '603202, 602911, '602974, '603006, '601883, '603008, '600074, '603007, '603046, #603903, 
'604433, '600016, '603925, '516005, '516004, '516003, '601756, 

'604487, '516001, '313475, '313470, #307810, '604527, '604528, '601745, '604551, '604555, '603243, 
'603242, '603061, '603063, '603217, '300335, '300283, '300281, '604600, '300197, '603097, '603220, 
'601625, '604623, '603118, *601590, '604646, '300008, '601568, '300007, '275630, '601533, #275200, 
'270200, #261550, '604031 , '604683, #254800, '251100, #242300, 

'604058, '604720, '240500, '233690, #232240, #226730, '223100, '222100, #220100, '216950, '604832, 
212750, 212067, '604066, '193067, 601315, '193001 , '604862, '604870, '191306, '600385, '604879, '191191 , 
'601296, '604914, '190181, '604119, #188550, '604925, '188410, #601287, '604939, '188380, '604126, 
'604945, '604148, '188060, '604982, '186854, '604988, 

*1 86360, *1 86355, *1 85250, '60091 6, '605008, '605009, 1 85020, '600734, '605024, *1 82331 , '605032, '605033, 
'182305, *1 80903, #179800, '179610, '605060, '179410, '178990, '176802, '605080, '176266, '176263, 
*1 76260, '600732, '1 73490, '6041 99, *1 73445, *1 73391 , 1 72290, '6051 47, '6051 49, *1 71 890, '600528, *1 71 833, 
'605185, #170500, '605193, #168000, '605196, 

'167055, '605205, '605208, 166900, '605216, '162651, '162010, '600504, #161400, '604253, #160800, 
'159460, '154540, '605254, '605261, '153634, '600429, '153337, '600424, '605292, #604286, #152700, 
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152423, '152310, '151625, '600153, *604313,*151523, '150325, '150320, *1 50292/6031 50, *1 50290, *1 50210, 
'605410, '605415, '605416, '605417, '605421 , '603149, 

'604349, '147940, *600282, M47880, *146928, *146661, '600150, '146630, '142622, '600018, '605461, 
•138981, '138590, *600023, '138330, '605495, M38297, *605512, *138230, #136900, #301310, '516006, 
5 '605545, *605546, M36131, '134660, *134350, *516002, *605589, *131235, #130050, *605625, '126455, 

*1 26064, #125310, '605670, '604534, *1 25240, *1 23836, *1 23830, 

'123620, '605702, #122200, '120980, '120360, '118510, '114835, *605710, *605716, *605722, *114217, 
•604561, *113810, '111740, #110800, *605748, *605752, *604564, *110600, *603160, *109610, *605784, 
#107480, *1 07273, *603192, *300169, *106195, *105210, *1 04615, '104614, *104210, *103850, 103581, 
10 *605876, *605877, *605879, '103220, *605887, *300150, *102910, 

•102670, -102576, *605916, *604629, *102575, * 102573, *300132, *101800, *605947 
3) Channel (member of membrane protein) 
361 entries found, searching for "channel" 

•176266, '600724, *182390, *123825, '114208, M14206, '176267, '114205, •601784, *600937, *114204, 
15 *603415, *600053, *114207, *114209, *605427, *604527, *604528, *600760, •601011, *192500, *118425, 

•600228, *176261, *602235, *600761, *600359, *300008, *182389, *600877, *602232, '176263, *182391, 
•601328, -600054, *603939, *602208, *601534, *600504, *602323, 

•603208, *601958, *603537, *601012, *601327, *600734, *602780, *602781 , *604433, *603220, *1 82392, 
•605874, *605873, *601745, *603888, *603219, *602604, *603796 s *302910, *602866, *601013, *602905, 
20 *602906, *603967, *600163, #170500, *152427, •180901, *176260, #601462, *603951, *601141, *604492, 

•600702, *602023, *600308, *602754, *1 07776, *1 76257, *602024, 

•601949, *605222, *601142, *602983, *193245, *600681 , *176265, *600235, *176262, *176258, *605206, 
•604427, *605411, *603305, *601219, -600150, *604065, *602343, *605223, *605720, *603906, *138249, 
•138253, *600843, *604385 : ^600003, *600935, *603940, *602727, *6021 58, 602911 , *600397, *602726, *600845, 

25 *605080, *600580 : *602872, *602106, *1 76264, *603953, 

•605722, *3001 1 0, * 1 38252, *6041 1 1 , *60271 7, *602420, *600570, 600844, *603493 , *600932, *60571 6, *1 38254, 
•603652, *300138, *605410, *1 76268, *605214, *605696, *300334, *604660, *176256, *605879, *603749, 
•603583, *602345, *604661 , *603787, 60331 3, *602982, *604337, *600846, *604662, *300328, *300281 , *602566, 
•602836, *604003, *603788, *603651, *602421 , 

30 *1 07777, #177200, *1 00725, #219700, *1 00690, *1 00710, #160800, #603830, #183086, *600509, #220400, 

#601144, *173910, M80902, *605692, #264350, #160900, #145600, #255700, *602076, *603061, *601313, 
•154275, #604233, -604532, #108500, #121201, #170400, *300225, *121014, *139311, #125800, #160120, 
•118503, 601439, #141500, #168300, *304040, #601887, #256450, 

•186945, *154276, #300009, #216900, ^600040, *601014, *601042, '602512, *601383, *605445, *602368, 
35 '603831, #117000, *601218, *1 08745, *605248, #177735, #173900, *601212, *182139, *601059, *600039, 

•601485, *1 80903, *1 86360, '603319, #600101, *1 18509, *600109, #121200, *600170, *604187, *1 76975, 
•137163, #310468, #263800, #262300, *603750, *600229, *1 24030, 

•602251, #603829, *1 37143, #145500, *600669, '147450, *1 54050, *603353, *600516, '601157, *600855, 
•601154, *602522, *249210, *600968, #252650, *171060, *600919, *1 56490, #259700, #601678, '601764, 
40 #310500, *131244, *300041 , '121011, '125950, *114180, '602974, '600637, '113730, *118504, *605145, 

•604669, *118800, M21013, '121015, '138491 , '600421 , '104610, 

'604045, *604594, '131230, '605487, '138247, *600467, #602485, '602481, *1 38251, *137192, *602403, 
600851, '277900, '603785, '603152, '603199, '603475, #168600, #272120, '170280, '603852, #241200, 
•603053, -600465, #603034, *142461, '164920, M37164, *600884, '600442, '123885, '604001, '600232, 

45 '232200, '171050, '602103, '602014, '300211 , '600983, '602887, 

'60441 5, '60441 8, '300242, #300071 , '604471 , '600837, 1 68350, *1 1 851 1 , 1 93007, *600300, *604654, #601 820, 
•180297, *600046, *603853, *604678, *604693, #604772, •118508, *603855, *605204, #254210, *182099, 
•182307, #130600, *601109, *114080, *300103, *182860, *605438, '601129, '603964, '600019, '516060, 
#185000, '138079, '104210, '605818, *603418, *305990, *305450 

so 4) Extracellular matrix 

21 8 entries found, searching for "extracellular matrix" 

•605912, *603479, *602201, '604633, '601418, '601548, '115437, '154870, '600754, '602261, '602285, 
'602262, '134797, '120361, '604629, '604871, *603321 , '603320, '601807, #154700, '116935, '185261, 
'120360, '185250, '605470, *603767, '253700, '190180, '128239, '308700, '276901, '193300, '120324, 
55 *1 88826, '602109, '155760, '600514, '600261 , #1 771 70, '600536, 

'147557, #116920, '150240, '601313, '120140, 601614, '605158, '120150, '120180, #200610, '605127, 
'193400, '192240, #173900, '152200, #136900, '135821, #130070, '120320, '120220, '112260, '310200, 
'600900, -600262, '605670, '600985, '179590, #245150, '602574, '601463, 183850, '601211, '604241, 
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*600758, *186745, *604710, *602369, *602090, *190182, *192975, 

*602178, *230740, *600065, *601652, M58106, M90181, M56790, #158810, *193210, *155120, *192977, 
*193065, #226700, *187380, *231050, M82120, *188060, *186355 : 163200, *164010, #156550, *151510, 
M50370, *253800, *156225, *150325, #194050, *150290, *216550, *147620, *600215, *222600, M47559, 

5 *165380, *182888, *600491 , M46650, M46640, *600564, *600596, 

*600616, *600700, *600742, *1 38297, *1 82889, *1 54705, *600930, *301870, *153619, *601050, *601090, 
*601105, *165070, *305370, *135820, M30660, *310300, *601492, *128240, *601587, #126600, *601636, 
*6001 1 9, *601 692, *601 728, *1 25485, 601 858, *601 915, *602048, *1 751 00, *6021 08, *1 21 01 0, *600245, *1 20470, 
*1 20328, *1 20325, *602264, *1 20280, *602366, *600309, 

10 *602402, *602415, *602428, *602453, *602505, #166210, *602600, *602941, *603005, *603196, 603209, 

*603221, *603234, *603319, *120250, *120210, *120120, *603489, *603551 , *118938, *603799, *603842, 
*603924, *603963, *604042, *604063, *604149, *604160, *601028, *604467, *604510, *604592, *1 16930, 
*1 16806, *601284, *604724, *604806, *604807, *604808, *1 07269, 

*605007, *605008, *605009, *600214, *600076, *605174, *605175, *605292, *605343, *605351, #600204, 
15 *605497, *605546, *605587 t *605623, *600211 , *605702, *1 03320 



[0121] In addition to these, the various keywords shown in the above-mentioned categorization or others can be 
used for the OMIM search and the result may suggest the involvement thereof in diseases. 

[01 22] Further, the use of nucleotide sequences of cDNAs of the present invention enables analyzing the expression 
20 frequency of genes corresponding to the cDNAs. In addition, functions of the genes can be predicted based on the 
information obtained by the expression frequency analysis. 

[0123] There are several methods for analyzing the expression levels of genes involved in diseases. Differences in 
gene expression levels between diseased and normal tissues are studied by the analytical methods using, for example, 
Northern hybridization, RT-PCR, DNA microarray, etc. (Experimental Medicine, Vol.17, No. 8, 980-1056 (1999); Cell 

25 Engineering (additional volume) DNA Microarray and Advanced PCR Methods, Muramatsu & Nawa (eds.), Shujunsya 
(2000)). By computer analysis, in addition to these analysis methods, the nucleotide sequences of expressed genes 
can be compared to analyze the expression frequency. For example, there is a database called "BODYMAP"; gene 
clones are extracted at random from cDNA libraries of various tissues and/or cells, and the clones homologous to one 
another are assigned to a single cluster based on the information of nucleotide sequence homology at the 3'-end; 

30 genes are classified into any clusters, and the numbers of clones in the respective clusters are compared to gain the 
information on expression frequency (http://bodymap.ims.u-tokyo.ac.jp/). 

[0124] When explicit difference in the expression levels between diseased tissues and normal tissues is observed 
for a gene by these analytical methods, it can be conclude that the gene is closely involved in a disease or disorder. 
Instead of diseased tissues, when gene expression is explicitly different between normal cells and cells reproducing 

35 disease-associated specific features, it can be concluded that the gene is closely involved in a disease or disorder. 
[0125] From the 1970 clones whose full-length nucleotide sequences had been revealed, genes involved in particular 
pathology or functions were selected by the use of databases shown below (see Example 7; "Expression frequency 
analysis in silico"). The database used in the analyses of the present invention contains nucleotide sequences of 
770,546 clones, and the population of the database is large enough for the analysis. The sequence information in the 

40 database was obtained by selecting cDNA clones at random from cDNA libraries derived from the various tissues and 
cells shown in Example 1 and determining the 5'-end sequences thereof. 

[0126] Then, the nucleotide sequences of respective clones in this database were categorized (clustered) based on 
the nucleotide sequence homology determined with a search program; the number of clones belonging to every cluster 
of each library was determined and normalized; thus, the ratio of a certain gene in a cDNA library was determined. 
45 This analysis provided the information of the expression frequency of a gene in a tissue or cell that is the source of the 
cDNA library. 

[0127] Then, in order to analyze the expression of genes corresponding to the nucleotide sequences of cDNAs of 
the present invention in tissues and cells, the libraries from the tissues or cells, which had been used in the large-scale 
cDNA analyses, were taken as subjects to compare the expression levels between different tissues or cells. Namely, 
50 the expression frequency was analyzed by comparing the previously normalized values between tissues or cells from 
which 600 or more cDNA clones whose nucleotide sequences had been analyzed were derived. The result of this 
analysis showed that the cDNA clones corresponded to the genes involved in the pathology and functions, which are 
indicated below. Each value in Tables 3 to 39 indicated below represents a relative expression frequency; the higher 
the value, the higher the expression level. 

55 

Osteoporosis-related genes 



[0128] Osteoporosis is a pathology in which bones are easily broken owing to overall decrease in components of 
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bone. The onset correlates to the balance between the functions of osteoblast producing bone and osteoclast absorbing 
bone, namely bone metabolism. Thus : the genes involved in the increase of osteoclasts differentiating from precursor 
cells of monocyte/macrophage line (Molecular Medicine 38. 642-648. (2001)) are genes involved in osteoporosis rel- 
evant to bone metabolism. 

5 [0129] A nucleotide sequence information-based analysis was carried out to identify the genes whose expression 
frequencies are higher or lower in CD34+ cell (cell expressing a glycoprotein CD34) treated with the osteoclast differ- 
entiation factor (Molecular Medicine 38. 642-648. (2001)) than in the untreated CD34+cell, which is the precursor cell 
of monocyte/macrophage line. The result of comparative analysis for the frequency between the cDNA libraries pre- 
pared from the RNA of CD34+ cells (CD34C) and from the RNA of CD34+ cells treated with the osteoclast differentiation 

10 factor (D30ST, D60ST or D90ST) showed that the genes whose expression levels were different between the two were 
26 clones indicated in Table 3. These clones are involved in osteoporosis. 

Genes involved in neural cell differentiation 

15 [0130] Genes involved in neural cell differentiation are useful for treating neurological diseases. Genes with varying 
expression levels in response to induction of cellular differentiation in neural cells are thought to be involved in neuro- 
logical diseases. 

[0131] A survey was performed for genes whose expression levels are varied in response to induction of differenti- 
ation (stimulation by retinoic acid (RA) or growth inhibitor treatment after RA stimulation) in cultured cells of a neural 
20 strain, NT2. The result of comparative analysis of cDNA libraries derived from undifferentiated NT2 cells (NT2RM) and 
the cells subjected to the differentiation treatment (NT2RP, NT2RI or NT2NE) showed that the genes whose expression 
levels were different between the two were 193 clones indicated in Table 4. These genes are neurological disease- 
related genes. 

25 Cancer-related genes 

[0132] It has been assumed that, distinct from normal tissues, cancer tissues express a distinct set of genes, and 
thus the expression thereof can contribute to the carcinogenesis in tissues and cells. Thus, genes whose expression 
patterns in cancer tissues are different from those in normal tissues are cancer-related genes. Search was carried out 
30 for the genes whose expression levels in cancer tissues were different from those in normal tissues. 

[0133] The result of comparative analysis of cDNA libraries derived from breast tumor (TBAES) and normal breast 
(BEAST) showed that the genes whose expression levels were different between the two were 4 clones indicated in 
Table 5. 

[0134] The result of comparative analysis of cDNA libraries derived cervical tumor (TCERX) and normal cervical duct 
35 (CERVX) showed that the genes whose expression levels were different between the two was one clone indicated in 
Table 6. 

[0135] The result of comparative analysis of cDNA libraries derived from colon tumor (TCOLN) and normal colon 
(COLON) showed that the genes whose expression levels were different between the two was one clone indicated in 
Table 7. 

*o [0136] The result of comparative analysis of cDNA libraries derived from esophageal tumor (TESOP) and normal 
esophagus (NESOP) showed that the genes whose expression levels were different between the two were 6 clones 
indicated in Table 8. 

[0137] The result of comparative analysis of cDNA libraries derived from kidney tumor (TKIDN) and normal kidney 
(KIDNE) showed that the genes whose expression levels were different between the two were 132 clones indicated 
^5 in Table 9. 

[0138] The result of comparative analysis of cDNA libraries derived from liver tumor (TLIVE) and normal liver (LIVER) 
showed that the genes whose expression levels were different between the two were 25 clones indicated in Table 10. 
[0139] The result of comparative analysis of cDNA libraries derived from lung tumor (TLUNG) and normal lung 
(HLUNG) showed that the genes whose expression levels were different between the two were 99 clones indicated in 
50 Table 11. 

[0140] The result of comparative analysis of cDNA libraries derived from ovary tumor (TOVER) and normal ovary 
(NOVER) showed the genes whose expression levels were different between the two were 3 clones indicated in Table 
12. 

[01 41 ] The result of comparative analysis of cDNA libraries derived from stomach tumor (TSTOM) and normal stom- 
as ach (STOMA) showed that the genes whose expression levels were different between the two were 1 5 clones indicated 
in Table 13. 

[0142] The result of comparative analysis of cDNA libraries derived from uterine tumor (TUTER) and normal uterus 
(UTERU) showed that the genes whose expression levels were different between the two were 97 clones indicated in 
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Table 14. 

[01 43] The result of comparative analysis of cDN A libraries derived from tongue cancer (CTONG) and normal tongue 
(NTONG) showed that the genes whose expression levels were different between the two were 203 clones indicated 
in Table 15. 

5 [0144] These genes are involved in cancers. 

[0145] Further, there is a method to search for genes involved in development and differentiation, which is the ex- 
pression frequency analysis in which the expression levels of genes are compared between developing and/or differ- 
entiating tissues and/or cells and adult tissues and/or cells. The genes involved in tissue development and/or differ- 
entiation are genes participating in tissue construction and expression of function, and thus are useful genes, which 

10 are available for regenerative medicine aiming at convenient regeneration of injured tissues. 

[0146] By using the information of gene expression frequency gained from the database of 5'-end nucleotide se- 
quences described above, genes involved in development or differentiation of particular tissues were selected from 
the 1970 clones whose full-length nucleotide sequence had been revealed (see Example 7). 

[0147] The result of comparative analysis of cDNA libraries derived from fetal brain (FCBBF, FEBRA or OCBBF) and 
15 adult brain (BRACE, BRALZ, BRAMY, BRAWH, BRCAN, BRCOC, BRHIP, BRSSN, BRSTN or BRTHA) showed that 

the genes whose expression levels were different between the two were 775 clones indicated in Tables 1 6 to 36. 

[01 48] The result of comparative analysis of cDN A libraries derived from fetal heart (FEHRT) and adu It heart (H E ART) 

showed that the genes whose expression levels were different between the two were 28 clones indicated in Table 37. 

[0149] The result of comparative analysis of cDNA libraries derived from fetal kidney (FEKID) and adult kidney 
20 (KIDNE) showed that the genes whose expression levels were different between the two were 95 clones indicated in 

Table 38. 

[01 50] The result of comparative analysis of cDN A libraries derived from fetal lung (FELNG) and adult lung (HLUNG) 
showed that the genes whose expression levels were different between the two were 99 clones indicated in Table 39. 
These genes are involved in regeneration of tissues and/or Cells. 

25 [01 51 ] The expression frequency or the like can be analyzed by PCR based on the nucleotide sequences of cDNAs 
of the present invention. There are some known methods for comparing the quantities of amplification products obtained 
by PCR. For example, the band intensities can be determined by ethidium bromide staining. With Rl-labeled or fluo- 
rescently labeled primers, the Rl signal or fluorescence intensity can be assayed for the quantity of labeled amplification 
products. Alternatively, the quantity of amplification products can also be determined by measuring the Rl signal or the 

30 fluorescence intensity from the Rl-labeled or fluorescently labeled probe hybridizing to the products. The assay results 
thus obtained are compared and then the clones exhibiting differences in the expression levels can be selected. 
[0152] There are some quantitative PCR methods: a PCR method using internal standards; a competitive PCR, in 
which the quantification is achieved by adding, to a sample, a dilution series of a known quantity of a template RNA 
and by comparing the quantity of an amplification product derived from the RNA of interest with the quantity of an 

35 amplification product derived from the template RNA. These methods overcome the problems of errors in the amount 
of amplification products among tubes and of the plateau effect. ATAC-PCR (Adaptor-tagged competitive PCR) is a 
method of competitive PCR which is practiced by using multiple adapters of different sizes attached to a gene whose 
3'-end nucleotide sequence has previously been determined. The ratio of expression frequency of a single mRNA 
species from a number of tissues (cells) can be assayed in a single step (Nucleic Acids Research 1997, 25(22): 

40 4694-4696; "DNA Micro-array and Advanced PCR Techniques", Cell Technology, supplement, Eds., Muramatsu and 
Nawa (Shujunsha, 2000): 104-112). 

[0153] If it is observed, by using these analytical methods, that the expression levels of genes are evidently varied 
during major cellular events (such as differentiation and apoptosis), the genes are involved in the cellular events and 
accordingly are candidates for disease- and/or disorder-related genes. Further, genes exhibiting tissue-specific ex- 
45 pression are genes playing important parts in the tissue functions and, therefore, can be candidates for genes involved 
in diseases and/or disorders affecting the tissues. 

[01 54] For example, inflammation is an important biological response that is known to be involved in various diseases. 
The representative inflammation-inducing factors include TNF-a (Tumor Necrosis Factor-alpha). There exists a sign- 
aling cascade activated by TNF-a stimulations, wherein NF-kB is a transducing molecule (Cell 1995, 80:529-532). It 
50 has also been revealed that many inflammation-related genes, including IL-2, IL-6 and G-CSF, are varied in the ex- 
pression levels thereof in response to the signal through the pathway (Trends Genet. 1999, 15(6): 229-235). It is 
assumed that genes whose expression levels are varied in response to the stimulation of TNF-a also participate in 
inflammation. 

[0155] Further, the infection of Helicobacter py tori to the gastric epithelia is known to cause gastritis and gastrodu- 
55 odenal ulcer (Mebio 2000, July, 17(7): 16-33). Thus, the genes whose expression levels are altered depending on co- 
culturing cells with Helicobacter pylori may be involved in gastritis and gastroduodenal ulcer. A recent study has sug- 
gested that Helicobacter pylori strongly activates the NF-kB pathway (Gastroenterology 2000, 119: 97-108). 
[0156] THP-1 cell, which is a human monocyte cell line, was cultured in the presence of TNF- a (Tumor Necrosis 
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Factor- alp ha). The genes whose expression levels were altered owing to the presence of TNF-a were searched for, 
and the result showed that the clones whose expression levels were increased owing to the presence of TNF-a were 
ASTRO20055530, ASTRO20055930, ASTRO20088950, BRAMY20027390, BRAMY20076130, BRAMY2011841 0, 
B R AM Y20 125360, BRAMY20237190, BRCAN20001480, BRHIP1 0000720, CD34C20001 750, CTONG200 78340, 

5 CTONG20085210, DFNES20063460, FCBBF20029280, FCBBF20033360, FCBBF30078600, FEBRA20007820, 
FEBRA20031280, FEBRA20031810, FEBRA20040290, HLUNG20041540, HLUNG20092530, MESAN20021860, 
MESAN20067430, MESAN20084150, NT2NE20092950, NT2RP 7003 1070, OCBBF20012520, OCBBF201 42290, 
OCBBF201 65900, OCBBF201 70350, OCBBF201 76650, PLACE60006300, PROST20011160, PROST201 06060, 
SPLEN200407B0, SPLEN201 1 0860, SPLEN201 77400, TESTI2003061 0, TESTI200431 30, TESTI20059370, 

10 TESTI20254480, THYMU1 0004280, THYMU20030460, THYMU20062520, THYMU 20 078240, THYMU20150190, 
TRACH20090060,TRACH20125620, UTERU20026620, UTERU20045200, UTERU200641 20, UTERU201 03200. 
[01 57] On the other hand, the clones whose expression levels were decreased owing to the presence of TNF-a were 
BNGH420052350, BRACE20052530, BRAMY20003880, CTONG201 70940, FCBBF30022680, FCBBF30225930, 
FCBBF30257370, FEBRA20046280, KIDNE20084030, KIDNE201 88630, NT2NE20082130, OCBBF201 10210, 

15 PLACE60061370, PROST20041460, PROST20075280, PROST20110120, SMINT20006020, TESTI20046540, 
TESTI20057200, TESTI20113940, TESTI2025791 0, TESTI20262940, TRACH201 49740. 
[0158] These clones are inflammation- related genes. 

[0159] MKN45, which is a gastric cancer cell line, was co-cultured with Helicobacter pylori. The genes whose ex- 
pression levels were altered owing to the presence of Helicobacter pylori were searched for, and the result showed 

20 that the clones whose expression levels were increased owing to the presence of Helicobacter pylori were 

BRAMY20028530, BRAMY20035380, OCBBF201 70350, PROST20011160, SKMUS20091 900, SPLEN20040780, 
THYMU20078240, TRACH201 90460, UTERU20045200, UTERU20064120, ASTRO2005553 CTONG201 70940, 
FEBRA20040290, MESAN20067430, PROST20016760, THYMU1 0004280, TRACH20090060, UTERU20041970, 
OCBBF201 42290, TESTI20030610. 

25 [01 60] On the other hand, the clones whose expression levels were decreased owing to the presence of Helicobacter 
pylori were 

ASTRO20088950, BRACE20052530, B RAM Y200 03880, BRAMY20027390, BRAMY20036530, BRAMY2011 841 0, 
BRHIP20000210, FCBBF20032930, FCBBF30022680, FCBBF301 69870, FEBRA201 82030, KIDNE201 82540, 
LIVER20007750, MESAN20021 220, NT2NE20059210, NT2NE20082130, OCBBF201 55030, PROST20065100, 
30 PROST20075280, SPLEN2011 0860, TESTI20057200, TESTI2011 3940, TESTI201 49880, TESTI20151800, 
TESTI201 98600, TESTI20257910, THYMU20046770, THYMU20058550, THYMU20150190, FCBBF20033360, 
FCBBF30257370, FEBRA20098040, SMINT20006020. 
[0161] These clones are involved in gastritis or gastroduodenal ulcer. 

[0162] For example, if the polypeptide encoded by the cDNA of the present invention is a regulatory factor of cellular 
35 conditions such as growth and differentiation, it can be used for developing medicines as follows. The polypeptide or 
antibody provided by the invention is injected into a certain kind of cells by microinjection. Then, using the cells, it is 
possible -to screen low molecular weight compounds, etc. by measuring the change in the cellular conditions, or the 
activation or inhibition of a particular gene. The screening can be performed as follows. 

[0163] First, the polypeptide is expressed and purified as recombinant. The purified polypeptide is microinjected into 
40 cells such as various cell lines, or primary culture cells, and the cellular change such as growth and differentiation can 
be examined. Alternatively, the induction of genes whose expression is known to be involved in a particular change of 
cellular conditions may be detected by the amount of mRNA or polypeptide. Alternatively, the amount of intracellular 
molecules (low molecular weight compounds, etc.) that is changed by the function of the gene product (polypeptide) 
which is known to be involved in a particular change of cellular conditions may be detected. The compounds to be 
45 screened (both low and high molecular compounds are acceptable) can be added to the culture media and assessed 
for their activity by measuring the change of the cellular conditions. 

[0164] Instead of microinjection, cell lines introduced with the gene obtained in the invention can be used for the 
screening. If the gene product is turn out to be involved in a particular change in the cellular conditions, the change of 
the product can be used as a measurement for screening. Once a compound is screened out which can activate or 
50 inhibit the function of the polypeptide of the invention, it can be applied for developing medicines. 

[0165] If the polypeptide encoded by the cDNA of the present invention is a secretory protein, membrane protein, or 
protein involved in signal transduction, glycoprotein, transcription, or diseases, it can be used in functional assays for 
developing medicines. 

[0166] In case of a membrane protein, it is most likely to be a polypeptide that functions as a receptor or ligand on 
55 the cell surface. Therefore, it is possible to reveal a new relationship between a ligand and receptor by screening the 
membrane protein of the invention based on the binding activity with the known ligand or receptor. Screening can be 
performed according to the known methods. 

[0167] For example, a ligand against the polypeptide of the invention can be screened in the following manner. 
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Namely, a ligand that binds to a specific polypeptide can be screened by a method comprising the steps of: (a) contacting 
a test sample with the polypeptide of the invention or a partial peptide thereof, or cells expressing these, and (b) 
selecting a test sample that binds to said polypeptide, said partial peptide, or said cells. 

[0168] On the other hand, for example, screening using cells expressing the polypeptide of the present invention 
5 that is a receptor protein can also be performed as follows. It is possible to screen receptors that is capable of binding 
to a specific polypeptide by using procedures (a) attaching the sample cells to the polypeptide of the invention or its 
partial peptide, and (b) selecting cells that can bind to the said polypeptide or its partial peptide. 
[0169] In a following screening as an example, first the polypeptide of the invention is expressed, and the recombinant 
polypeptide is purified. Next, the purified polypeptide is labeled, binding assay is performed using a various cell lines 
10 or primary cultured cells, and cells that are expressing a receptor are selected (Growth and differentiation factors and 
their receptors, Shin-Seikagaku Jikken Kouza Vol.7 (1991) Honjyo, Arai, Taniguchi, and Muramatsu edit, p203-236, 
Tokyo-Kagaku-Doujin). A polypeptide of the invention can be labeled with Rl such as 125 l, and enzyme (alkaline phos- 
phatase etc.). 

[01 70] Alternatively, a polypeptide of the invention may be used without labeling and then detected by using a labeled 
15 antibody against the polypeptide. The cells that are selected by the above screening methods, which express a receptor 
of the polypeptide of the invention, can be used for the further screening of an agonists or antagonists of the said 
receptor. 

[0171] Once the ligand binding to the polypeptide of the invention, the receptor of the polypeptide of the invention 
or the cells expressing the receptor are obtained by screening, it is possible to screen a compound that binds to the 
20 ligand and receptor. Also it is possible to screen a compound that can inhibit both bindings (agonists or antagonists of 
the receptor, for example) by utilizing the binding activities. 

[01 72] When the polypeptide of the invention is a receptor, the screening method comprises the steps of (a) contacting 
the polypeptide of the invention or cells expressing the polypeptide of the invention with the ligand, in the presence of 
a test sample, (b) detecting the binding activity between said polypeptide or cells expressing said polypeptide and the 

25 ligand, and (c) selecting a compound that reduces said binding activity when compared to the activity in the absence 
of the test sample. Furthermore, when the polypeptide of the invention is a ligand, the screening method comprises 
the steps of (a) contacting the polypeptide of the invention with its receptor or cells expressing the receptor in the 
presence of samples, (b) detecting the binding activity between the polypeptide and its receptor or the cells expressing 
the receptor, and (c) selecting a compound that can potentially reduce the binding activity compared to the activity in 

30 the absence of the sample. 

[0173] Samples to screen include cell extracts, expressed products from a gene library, synthesized low molecular 
compound, synthesized peptide, and natural compounds, for example, but are not construed to be listed here. A com- 
pound that is isolated by the above screening using a binding activity of the polypeptide of the invention can also be 
used as a sample. 

35 [0174] A compound isolated by the screening may be a candidate to be an agonist or an antagonist of the receptor 
of the polypeptide. By utilizing an assay that monitors a change in the intracellular signaling such as phosphorylation 
which results from reduction of the binding between the polypeptide and its receptor, it is possible to identify whether 
the obtained compound is an agonist or antagonist of the receptor. Also, the compound may be a candidate of a 
molecule that can inhibit the interaction between the polypeptide and its associated proteins (including a receptor) in 

40 vivo. Such compounds can be used for developing drugs for precaution or cures of a disease in which the polypeptide 
is involved. 

[0175] Secretory proteins may regulate cellular conditions such as growth and differentiation. It is possible to find 
out a novel factor that regulates cellular conditions by adding the secretory protein of the invention to a certain kind of 
cell, and performing a screening by utilizing the cellular changes in growth or differentiation, or activation of a particular 
^5 gene. 

[0176] The screening can be performed, for example, as follows. First, the polypeptide of the invention is expressed 
and purified in a recombinant form. Then, the purified polypeptide is added to a various kind of cell lines or primary 
cultured cells, and the change in the cell growth and differentiation is monitored. The induction of a particular gene 
that is known to be involved in a certain cellular change is detected by the amounts of mRNA and polypeptide. Alter- 
so natively, the amount of an intracellular molecule (low-molecular-weight compounds, etc.) that is changed by the function 
of a gene product (polypeptide) that is known to function in a certain cellular change is used for the detection. 
[0177] Once the screening reveals that then polypeptide of the invention can regulate cellular conditions or the func- 
tions, it is possible to apply the polypeptide as a pharmaceutical and diagnostic medicine for related diseases by itself 
or by altering a part of it into an appropriate composition. 
55 [0178] As is above described for membrane proteins, the secretory protein provided by the invention may be used 
to explore a novel ligand-receptor interaction using a screening based on the binding activity to a known ligand or 
receptor. A similar method can be used to identify an agonist or antagonist. The resulting compounds obtained by the 
methods can be a candidate of a compound that can inhibit the interaction between the polypeptide of the invention 
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and an interacting molecule (including a receptor). The compounds may be able to use as a preventive, therapeutic, 
and diagnostic medicine for the diseases, in which the polypeptide may play a certain role. 

[0179] Proteins involved in signal transduction or transcription may be a factor that affects a certain polypeptide or 
gene in response to intracellular/extracellular stimuli. It is possible to find out a novel factor.that can affect a polypeptide 
5 or gene by expressing the polypeptide provided by the invention in a certain types of cells, and performing a screening 
utilizing the activation of a certain intracellular polypeptide or gene. 

[0180] The screening may be performed as follows. First, a transformed cell line expressing the polypeptide is ob- 
tained. Then, the transformed cell line and the untransformed original cell line are compared for the changes in the 
expression of a certain gene by detecting the amount of its mRNA or polypeptide. Alternatively, the amount of an 

10 intracellular molecule (low molecular weight compounds, etc.) that is changed by the function of a certain gene product 
(polypeptide) may be used for the detection. Furthermore, the change of the expression of a certain gene can be 
detected by introducing a fusion gene that comprises a regulatory region of the gene and a marker gene (luciferase, 
p-galactosidase, etc.) into a cell, expressing the polypeptide provided by the invention into the cell, and estimating the 
activity of a marker gene product (polypeptide). 

15 [0181] If the polypeptide or gene of the invention is involved in diseases, it is possible to screen a gene or compound 
that can regulate its expression and/or activity either directly or indirectly by utilizing the polypeptide of the present 
invention. 

[0182] For example, the polypeptide of the invention is expressed and purified as a recombinant polypeptide. Then, 
the polypeptide or gene that interacts with the polypeptide of the invention is purified, and screened based on the 

20 binding. Alternatively, the screening can be performed by adding with a compound of a candidate of the inhibitor added 
in advance and monitoring the change of binding activity. In another method, a transcription regulatory region locating 
in the 5'-upstream of the gene encoding the polypeptide of the invention that is capable of regulating the expression 
of other genes is obtained, and fused with a marker gene. The fusion is introduced into a cell, and the cell is added 
with compounds to explore a regulatory factor of the expression of the said gene. 

25 [0183] The compound obtained by the screening can be used for developing pharmaceutical and diagnostic medi- 
cines for the diseases in which the polypeptide of the present invention is involved. Similarly, if the regulatory factor 
obtained in the screening is turn out to be a polypeptide, compounds that can newly affect the expression or activity 
of the polypeptide may be used as a medicine for the diseases in which the polypeptide of the invention is involved. 
[0184] If the polypeptide of the invention has an enzymatic activity, regardless as to whether it is a secretory protein, 

30 membrane protein, or proteins involved in signal transduction, glycoprotein, transcription, or diseases, a screening 
may be performed by adding a compound to the polypeptide of the invention and monitoring the change of the com- 
pound. The enzymatic activity may also be utilized to screen a compound that can inhibit the activity of the polypeptide. 
[0185] In a screening given as an example, the polypeptide of the invention is expressed and the recombinant 
polypeptide is purified. Then, compounds are contacted with the purified polypeptide, and the amount of the compound 

35 and the reaction products is examined. Alternatively, compounds that are candidates of an inhibitor are pretreated, 
then a compound (substrate) that can react with the purified polypeptide is added, and the amount of the substrate 
and the reaction products is examined. 

[0186] The compounds obtained in the screening may be used as a medicine for diseases in which the polypeptide 
of the invention is involved. Also they can be applied for tests that examine whether the polypeptide of the invention 

40 functions normally in vivo. 

[0187] Whether the secretory protein, membrane protein, signal transduction-related protein, glycoprotein-related 
protein, or transcription-related protein of the present invention is a novel protein involved in diseases or not is deter- 
mined in another method than described above, by obtaining a specific antibody against the polypeptide of the invention, 
and examining the relationship between the expression or activity of the polypeptide and a certain disease. In an 

45 alternative way, it may be analyzed referred to the methods in "Molecular Diagnosis of Genetic Diseases" (Elles R. 
edit, (1996) in the series of "Method in Molecular Biology" (Humana Press). 

[0188] Proteins involved in diseases are targets of screening as mentioned, and thus are very useful in developing 
drugs which regulate their expression and activity. Also, the proteins are useful in the medicinal industry as a diagnostic 
marker of the related disease or a target of gene therapy. 

so [0189] Compounds isolated as mentioned above can be administered patients as it is, or after formulated into a 
pharmaceutical composition according to the known methods. For example, a pharmaceutical^ acceptable carrier or 
vehicle, specifically sterilized water, saline, plant oil, emulsifier, or suspending agent can be mixed with the compounds 
appropriately. The pharmaceutical compositions can be administered to patients by a method known to those skilled 
in the art, such as intraarterial, intravenous, or subcutaneous injections. The dosage may vary depending on the weight 

55 or age of a patient, or the method of administration, but those skilled in the art can choose an appropriate dosage 
properly. If the compound is encoded by polynucleotide, the polynucleotide can be cloned into a vector for gene therapy, 
and used for gene therapy. The dosage of the polynucleotide and the method of its administration may vary depending 
on the weight or age of a patient, or the symptoms, but those skilled in the art can choose properly. 
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[0190] The present invention further relates to databases comprising at least a sequence of polynucleotide and/or 
polypeptide, or a medium recorded in such databases : selected from the sequence data of the nucleotide and/or the 
amino acids indicated in Table 1 . The term "database" means a set of accumulated information as machine-searchable 
and readable information of nucleotide sequence. The databases of the present invention comprise at least one of the 

5 novel nucleotide sequences of polynucleotides provided by the present invention. The databases of the present inven- 
tion can consist of only the sequence data of the novel polynucleotides provided by the present invention or can com- 
prise other information on nucleotide sequences of known full-length cDNAs or ESTs. The databases of the present 
invention can be comprised of not only the information on the nucleotide sequences but also the information on the 
gene functions revealed by the present invention. Additional information such as names of DNA clones carrying the 

10 full-length cDNAs can be recorded or linked together with the sequence data in the databases. 

[0191 J The database of the present invention is useful for gaining complete gene sequence information from partial 
sequence information of a gene of interest. The database of the present invention comprises nucleotide sequence 
information of full-length cDNAs. Consequently, by comparing the information in this database with the nucleotide 
sequence of a partial gene fragment yielded by differential display method or subtraction method, the information on 

15 the full-length nucleotide sequence of interest can be gained from the sequence of the partial fragment as a starting clue. 
[01 92] The sequence information of the full-length cDNAs constituting the database of the present invention contains 
not only the information on the complete sequences but also extra information on expression frequency of the genes 
as well as homology of the genes to known genes and known polypeptides. Thus the extra information facilitates rapid 
functional analyses of partial gene fragments. Further, the information on human genes is accumulated in the database 

20 of the present invention, and therefore, the database is useful for isolating a human homologue of a gene originating 
from other species. The human homologue can be isolated based on the nucleotide sequence of the gene from the 
original species. 

[0193] At present, information on a wide variety of gene fragments can be obtained by differential display method 
and subtraction method. In general, these gene fragments are utilized as tools for isolating the full-length sequences 

25 thereof. When the gene fragment corresponds to an already-known gene, the full-length sequence is easily obtained 
by comparing the partial sequence with the information in known databases. However, when there exists no information 
corresponding to the partial sequence of interest in the known databases, cDNA cloning should be carried out for the 
full-length cDNA. It is often difficult to obtain the full-length nucleotide sequence using the partial sequence information 
as an initial clue. If the full-length of the gene is not available, the amino acid sequence of the polypeptide encoded by 

30 the gene remains unidentified. Thus the database of the present invention can contribute to the identification of full- 
length cDNAs corresponding to gene fragments, which cannot be revealed by using databases of known genes. 
[01 94] The present invention has provided 1 970 polynucleotides. As has not yet proceeded the isolation of full-length 
cDNA within the human, the invention has great significance. It is known that secretory proteins, membrane proteins, 
signal transduction -related proteins, glycoprotein-related proteins, transcription-related proteins, and so on are involved 

35 in many diseases. The genes and proteins involved in diseases are useful for developing a diagnostic marker or med- 
icines for regulation of their expression and activity, or as a target of gene therapy. 

[0195] In particular, cDNA assumed to encode secretory proteins, which were provided by this invention, are very 
important for the industry since the encoded proteins themselves are expected to be useful as pharmaceutical agents 
and many disease-related genes may be included in them. In addition, membrane proteins, signal transduction-related 
40 proteins, transcription-related proteins, disease-related proteins, and genes encoding them can be used as indicators 
for diseases, etc. These cDNA are also very important for the industry, which are expected to regulate the activity or 
expression of the encoded protein to treat diseases, etc. 

[0196] Any patents, patent applications, and publications cited herein are incorporated by reference. 
[0197] The invention is illustrated more specifically with reference to the following examples, but is not to be construed 
45 as being limited thereto. 

EXAMPLE 1 

Preparation of cDNA library by oligo-capping 

so ' ~ ~ ~" " 

(1) Extraction and purchase of mRNA 

[01 98] Total RNAs as mRNA sources were extracted from human tissues (shown below) by the method as described 
in the reference (J. Sambrook, E. F. Fritsch &T Maniatis, Molecular Cloning Second edition, Cold Spring harbor Lab- 
55 oratory Press, 1989). Further, by the method as described in the reference (J. Sambrook, E. F. Fritsch & T. Maniatis, 
Molecular Cloning Second edition, Cold Spring harbor Laboratory Press, 1989), total RNAs as mRNA sources were 
extracted from human culture cells and human primary culture cells (shown below) which had been cultivated by the 
methods described in the catalogs. 
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[0199] The library names and the origins are indicated below in the order of "Library name: Origin". When a library 
was prepared by the subtraction method, the item is followed by a description of how to prepare the subtracted library. 

<Extraction of mRNA from human tissues> 

5 

[0200] 

NTONG: Normal tongue; 
CTONG: Tongue cancer; 
10 FCBBF: Fetal brain; 

OCBBF: Fetal brain; 
PLACE: Placenta; 

SYNOV: Synovial membrane tissue (from rheumatioid arthritis). 
15 < Extraction of mRNA from culture cells> 
[0201] 

BNGH4: H4 cells (ATCC #HTB-148); 
20 IMR32: IMR32 cells (ATCC #CCL-127); 

SKNMC: SK-N-MC cells (ATCC #HTB-10); 

3NB69: NB69 cells (RCB #RCB0480); 

BGGI1 : GI1 cells (RCB #RCB0763); 

NB9N4: NB9 cells (RCB #RCB0477); 
25 SKNSH: SK-N-SH cells (RCB #RCB0426); 

NT2RM: NT2 cells (STARATAGENE #204101); 

NT2RP: NT2 cells treated with retinoic acid (RA) for 5 weeks to induce the differentiation; 

NT2RI: NT2 cells treated with RA for 5 weeks to induce the differentiation, followed by the treatment with the growth 
inhibitor for 2 weeks; 

30 NT2NE: NT2 cells were treated with RA and the growth inhibitor for the neuronal differentiation, and the resultant 

neurons were concentrated and harvested (NT2 Neuron); 

NTISM: NT2 cells (STARATAGENE #204101) were treated with RA for 5 weeks to induce the differentiation, and 
then treated with the growth inhibitor for 2 weeks; mRNA was prepared from the ceils and a cDNA library was 
constructed from the mRNA; the cDNAs of the library whose nucleotide sequences were shared by those of mRNAs 
35 from undifferentiated NT2 cells were subtracted by using a Subtract Kit (Invitrogen #K4320-01) ; the subtracted 

library (NT2RI-NT2RM) was provided by this procedure. 

[0202] RCB indicates that the cell was provided by the Cell Bank, RIKEN GENE BANK, The Institute of Physical and 
Chemical Research; ATCC indicates that the cell was provided by American Type Culture Collection. 

40 

< Extraction of mRNA from primary culture cells> 
[0203] 

45 ASTRO: Normal human astrocyte NHA5732, Takara Shuzo #CC2565; 

DFNES: Normal human dermal fibroblast (neonatal skin); NHDF-Neo NHDF2564, Takara Shuzo #CC2509; 

MESAN: Normal human mesangial cell NHMC56046-2, Takara Shuzo #CC2559; 

NHNPC: Normal human neural progenitor cell NHNP5958, Takara Shuzo #CC2599; 

PEBLM: Normal human peripheral blood mononuclear cell HPBMC5939, Takara Shuzo #CC2702; 
50 HSYRA: Human synoviocyte HS-RA (from rheumatioid arthritis), Toyobo #T404K-05; 

PUAEN: Normal human pulmonary artery endothelial cells, Toyobo #T302K-05; 

UMVEN: Normal human umbilical vein endothelial cell HUVEC, Toyobo #T200K-05; 

HCASM: Normal human coronary artery smooth muscle cell HCASMC, Toyobo #T305K-05; 

HCHON: Normal human chondrocyte HC, Toyobo #T402K-05; 
55 HHDPC: Normal human dermal papilla cell HDPC, Toyobo #THPCK-001; CD34C: CD34+ cells (AllCelts, LLC 

#CB14435M) ; 

D30ST: CD34+ cells treated with the osteoclast differentiation factor (ODF) for 3 days to induce the differentiation; 
D60ST: CD34+ cells treated with ODF for 6 days to induce the differentiation; 
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D90ST: CD34+ cells treated with ODF for 9 days to induce the differentiation. 

[0204] Then, total RNAs extracted from the following human tissues were purchased and used as mRNA sources. 
The library names and the origins are indicated below in the order of "Library name: Origin". When a library was 
5 prepared by the subtraction method, the item is followed by a description of how to prepare the subtracted library. 

<Purchase of total RNA containing mRNA extracted from human tissues> 

[0205] 

10 

ADRGL: Adrenal gland, CLONTECH #64016-1; 

BRACE: Brain (cerebellum), CLONTECH #64035-1; 

BRAWH: Whole brain, CLONTECH #64020-1 ; 

FEBRA: Fetal brain, CLONTECH #64019-1; 
15 FELIV: Fetal liver, CLONTECH #6401 8-1 ; 

HEART: Heart, CLONTECH #64025-1; 

HLUNG: Lung, CLONTECH #64023-1; 

KIDNE: Kidney, CLONTECH #64030-1; 

LIVER: Liver, CLONTECH #64022-1 ; 
20 MAMGL: Mammary Gland, CLONTECH #64037-1 ; 

PANCR: Pancreas, CLONTECH #64031-1; 

PROST: Prostate, CLONTECH #64038-1; 

SALGL: Salivary Gland, CLONTECH #64026-1; 

SKMUS: Skeletal Muscle, CLONTECH #64033-1; 
25 SMINT: Small Intestine, CLONTECH #64039-1 ; 

SPLEN: Spleen, CLONTECH #64034-1; 

STOMA: Stomach, CLONTECH #64090-1; 

TBAES: Breast (Tumor), CLONTECH #64015-1; 

TCERX: Cervix (Tumor), CLONTECH #64010-1; 
30 TCOLN: Colon (Tumor), CLONTECH #64014-1 ; 

TESTI: Testis, CLONTECH #64027-1 ; 

THYMU: Thymus, CLONTECH #64028-1; 

TLUNG: Lung (Tumor), CLONTECH #64013-1; 

TOVAR: Ovary (Tumor), CLONTECH #64011-1; 
35 TRACH: Trachea, CLONTECH #64091-1; 

TUTER: Uterus (Tumor), CLONTECH #64008-1; 

UTERU: Uterus, CLONTECH #64029-1; 

ADIPS: Adipose, Invitrogen #D6005-01; 

BLADE: Bladder, Invitrogen #D6020-01 ; 
40 BRALZ: Cerebral cortex from an Alzheimer patient (Brain, cortex, Alzheimer), Invitrogen #D6830-01; 

CERVX: Cervix, Invitrogen #D6047-01; 

COLON: Colon, Invitrogen #D6050-0; 

NESOP: Esophagus, Invitrogen #D6060-01; 

PERIC: Pericardium, Invitrogen #D61 05-01; 
45 RECTM: Rectum, Invitrogen #D61 10-01 ; 

TESOP: Esophageal (Tumor), Invitrogen #D6860-01 ; 

TKIDN: Kidney (Tumor), Invitrogen #D6870-01; 

TLIVE: Liver (Tumor), Invitrogen #D6880-01; 

TSTOM: Stomach (Tumor), Invitrogen #D6920-01; 
50 BEAST: Adult breast, STARATAGEN E #735044; 

FEHRT: Fetal heart, STARATAGENE #738012; 

FEKID: Fetal kidney, STARATAGENE #738014; 

FELNG: Fetal lung, STARATAGENE #738020; 

NOVAR: Adult ovary, STARATAGENE #735260; 
55 BRASW: subtracted library (BRALZ-BRAWH). A cDNA library was constructed from mRNA prepared from tissues 

of cerebral cortex obtained from an Alzheimer patient [BRALZ: Cerebral cortex from an Alzheimer patient (Brain, 

cortex, Alzheimer), Invitrogen #D6830-01]; the cDNAs of this library whose nucleotide sequences were shared by 

those of mRNAs from whole brain tissue [BRAWN: Whole brain, CLONTECH #64020-1] were subtracted by using 



76 



EP 1 308 459 A2 



a Subtract Kit (Invitrogen #K4320-01). 

[0206] Further, mRNAs extracted and purified as poly A(+) RNAs from the human tissues shown below were pur- 
chased. A cDNA library was prepared from an RNA mixture in which the poly A(+) RNA from each tissue had been 
5 combined with poly A(-) RNA. The poly A(-) RNA was prepared by removing poly A(+) RNA from the total RNA of whole 
brain tissue (CLONTECH #64020-1 ) by using oiigo dT cellulose. The library names and the origins are indicated below 
in the order of "Library name: Origin". 

<Purchase of mRNAs of human tissues as poly A(+) RNAs> 

10 

[0207] 

BRAMY: Brain (amygdala), CLONTECH #6574-1 ; 
BRCAN: Brain (caudate nucleus), CLONTECH #6575-1; 
15 BRCOC: Brain (corpus callosum), CLONTECH #6577-1 ; 

BRHIP: Brain (hippocampus), CLONTECH #6578-1; 
BRSSN: Brain (substantia nigra), CLONTECH #6580-1; 
BRSTN: Brain (subthalamic nucleus), CLONTECH #6581-1; 
BRTHA: Brain (thalamus), CLONTECH #6582-1. 

20 

(2) Preparation of cDNA library 

[0208] cDNA library was prepared from each RNA by the improved method (WO 01/04286) of oligo capping [M. 
Maruyama and S. Sugano, Gene, 1 38: 1 71 -1 74 (1 994)]. A series of procedures, BAP (Bacterial Alkaline Phosphatase) 

25 treatment, TAP (Tobacco Acid Pyrophosphatase) treatment, RNA ligation, first strand cDNA synthesis and RNA re- 
moval, were carried out using the oligo-cap linker (SEQ ID NO: 4093) and oligo dT primer (SEQ ID NO: 4094), as 
described in WO 01/04286. Then, the single-stranded cDNA was converted to a double-stranded cDNA by PCR 
(polymerase chain reaction) using 5' (SEQ ID NO: 4095) and 3* (SEQ ID NO: 4096) PCR primers, and then digested 
with Sfil. Then, a fraction of cDN A fragments, typically 2 -kb or longer (3-kb or longer in some cases), was unidirectionally 

30 cloned into a Dralll-digested pME1 8SFL3 vector (Figure 1 ) (GenBank AB009864, Expression vector); the cDNA library 
was thus prepared. 

[0209] The names of cDNA libraries, which were used in the analysis of full-length cDN A sequences, and their origins 
are shown in Table 2. 

35 Table 2 



Library 


Type 


Origin etc. 


ADRGL 


Tissue 


Adrenal gland (CLONTECH #64016-1) 


ASTRO 


Primary culture cell 


Normal Human Astrocyte NHA5732 (Takara Shuzo #CC2565) 


BGGI1 


Culture cell 


GI1 cells (RCB #RCB0763) 


BNGH4 


Culture cell 


H4 cells (ATCC #HTB-1 48) ! 


BRACE 


Tissue 


Brain, cerebellum (CLONTECH #64035-1) 


BARMY 


Tissue 


Brain, amygdala (CLONTECH #6574-1) 


BRAWH 


Tissue 


Brain, whole (CLONTECH #64020-1) 


BRCAN 


Tissue 


Brain, caudate nucleus (CLONTECH #6575-1) 


BRCOC 


Tissue 


Brain, corpus callosum (CLONTECH #6577-1) 


BRHIP 


Tissue 


Brain, hippocampus (CLONTECH #6578-1) 


BRSSN 


Tissue 


Brain, substantia nigra (CLONTECH #6580-1) 


CD34C 


Primary culture cell 


CD34+ cells (AllCells, LLC #CB14435M) 


CTONG 


Tissue 


Tongue, Cancer 


DFNES 


Primary culture cell 


Normal Human Dermal Fibroblasts (Neonatal Skin); NHDF-Neo 
NHDF2564 (Takara Shuzo #CC2509) 
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Table 2 (continued) 





Library 


Type 


Origin etc. 




FCBBF 


Tissue 


Brain, Fetal 


5 


FEBRA 


Tissue 


Brain, Fetal (CLONTECH #64019-1) j 




HCHON 


Primary culture cell Tissue 


Human Chondrocytes HC (Toyobo #T402K-05) 




HEART 




Heart (CLONTECH #64025-1) 


10 


HHDPC 


Primary culture cell 


Human dermal papilla cells HDPC (Toyobo #THPCK-001) 




HLUNG 


Tissue 


Lung (CLONTECH #64023-1 ) : 




KIDNE 


Tissue 


Kidney (CLONTECH #64030-1) 


15 


LIVER 


Tissue 


Liver (CLONTECH #64022-1) 


MESAN 


Primary culture cell 


Normal human mesangial cells NHMC56046-2 (Takara Shuzo #CC2559) 




NESOP 


Tissue 


Esophagus (Invitrogen #D6060-01) 




NT2NE 


Culture cell 


NT2 cells concentrated after differentiation (NT2 Neuron) 


20 


NT2RI 


Culture cell 


NT2 cells treated by growth inhibitor for 2 weeks after RA induction for 5 
weeks 




NT2RP 


Culture cell 


NT2 cells treated by RA for 5 weeks 




NTONG 


Tissue 


Tongue 


25 


OCBBF 


Tissue 


Brain, Fetal 




PANCR 


Tissue 


Pancreas (CLONTECH #64031-1) 


30 


PEBLM 


Primary culture cell 


Human peripheral blood mononuclear cells HPBMC5939 (Takara Shuzo 
#CC2702) 


PLACE 


Tissue 


Placenta 

1 luwwlllu 




PROST 


Tissue 


Prostate (CLONTECH #&4038-1^ 




PUAEN 


Primary culture cell 


Human nulmnnarv artprv pnrinthelial r^llQ fTnvnhn &~T'\CiOVC-C\t : \\ 


35 


SALGL 


Tissue 


Salivarv Gland fCLONTFCH #640?6-1^ 




SKMUS 


Tissue 


Skeletal Muscle (CLONTECH #640^-1* 




SKNMC 


Culture cell 


SK-N-MC cells fATCC #HTR-10^ 


40 


SKNSH 


Culture cell 


SK-N-SH cells (RCB #RCB0426) 


SMINT 


Tissue 


Small Intestine (CLONTECH #64039-1) 




SPLEN 


Tissue 


Spleen (CLONTECH #64034-1) 




TESOP 


Tissue 


Esophageal, Tumor (Invitrogen #D6860-01) 


45 


TESTI 


Tissue 


Testis (CLONTECH #64027-1) 




THYMU 


Tissue 


Thymus (CLONTECH #64028-1) 




TKIDN 


Tissue 


Kidney, Tumor (Invitrogen #D6870-01) 


50 


TRACH 


Tissue 


Trachea (CLONTECH #64091-1) 


UMVEN 


primary culture cell 


Human umbilical vein endothelial cells HUVEC (Toyobo #T200K-05) 




UTERU 


Tissue 


Uterus (CLONTECH #64029-1) 



[0210] The cDNA library with the high fullness ratio (the fullness ratio of 5'-end, which was calculated for each cDNA 
library by using the protein coding region found in known mRNA species as an index, was 90% in average) prepared 
by the improved oligo-capping method was constructed by using a eukaryotic expression vector pME18SFL3. The 
vector contains SRot promoter and SV40 small t intron in the upstream of the cloning site, and SV40 polyA added signal 
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sequence site in the downstream. As the cloning site of pME18SFL3 has asymmetrical Dralll sites, and the ends of 
cDNA fragments contain Sfil sites complementary to the Dralll sites, the cloned cDNA fragments can be inserted into 
the downstream of the SRoc promoter unidirectionally. Therefore, clones containing full-length cDNA can be expressed 
transiently by introducing the obtained plasmid directly into COS cells, etc. Thus, the clones can be analyzed very 
s easily in terms of the proteins that are the gene products of the clones, or in terms of the biological activities of the 
proteins. 

(3) Assessment of the 5'-end completeness of clones derived from the cDNA library prepared by oligo-capping 

10 [021 1 ] With respect to the plasmid DNAs of clones derived from the libraries, the nucleotide sequences of cDNA 5'- 
ends (3'-ends as well in some cases) were determined in a DNA sequencer (ABl PRISM 3700, PE Biosystems), after 
sequencing reaction was conducted by using a DNA sequencing reagent (BigDye Terminator Cycle Sequencing FS 
Ready Reaction Kit, PE Biosystems) according to the manual. A database was constructed based on the obtained data. 
[0212] The 5' -end completeness of about 770,000 clones derived from the human cDNA libraries prepared by the 

15 improved oligo-capping method was determined by the following method. The clones whose 5*-end sequences were 
consistent with those of known human mRNA in the public database were judged to be "full-length" if they had a longer 
5'-end sequence than that of the known human mRNA; or even though the 5'-end sequence was shorter, if it contained 
the translation initiation codon it was judged to have the "full-length" sequence. Clones which did not contain the trans- 
lation initiation codon were judged to be "not-full-length". The fullness ratio ((the number of full-length clones)/(the 

20 number of full-length and not-full-length clones)) at the 5'-end of the cDNA clones was determined by comparing with 
known human mRNA. As a result, the fullness ratio of the 5'-ends was 90%. The result indicates that the fullness ratio 
at the 5 l -end sequence was extremely high in the human cDNA clones obtained by the oligo-capping method. 

EXAMPLE 2 

25 

Sequencing analysis of cDNA ends and selection of full-length clones 

[0213] With respect to the plasmid DNAs of clones obtained from each cDNA library, the 5' -end nucleotide sequences 
of the cDNAs were determined in a DNA sequencer (ABl PRISM 3700, PE Biosystems), after sequencing reaction 
30 was conducted by using a DNA sequencing reagent (Dye Terminator Cycle Sequencing FS Ready Reaction Kit, dRhod- 
amine Terminator Cycle Sequencing FS Ready Reaction Kit or BigDye Terminator Cycle Sequencing FS Ready Re- 
action Kit, PE Biosystems) according to the manual. A database was constructed using the data obtained. 
[0214] For the analyzed 5'-end sequences of cDNA clones, the data with the annotation of "complete cds" in the 
GenBank and UniGene were searched by BLAST homology search. When identical to certain human mRNA sequenc- 
es es, such cDNA clones were excluded. Then, clustering was carried out. When the identity was 90% or higher, and the 
length of consensus sequence was 50 base pairs or longer, the cDNA clones were assumed to belong to an identical 
cluster, and thus clustered. cDNA clones longer in the 5* direction were selected from the members belonging to a 
cluster; if required, the 3'-end sequences of the selected clones were determined by the same analysis method as 
used to determine the 5' -end sequences. The data of the end sequences obtained were analyzed, and then the clones 
40 forming a sequence contig at 5'- and 3'-ends were excluded. Further, as mentioned above, the data was analyzed 
again by BLAST homology search: when identical to certain human mRNA sequences (including sequences patented 
and applied for), the cDN A clones were excluded. Thus, the cDNAs clones to be analyzed for their nucleotide sequence 
were obtained. 

45 EXAMPLE 3 

Analysis of the full-length nucleotide sequences 

[0215] The full-length nucleotide sequences of the selected clones were determined. The nucleotide sequence de- 
so termination was mainly performed by primer walking method comprising the dideoxy terminator method using custom- 
made synthetic DNA primers. Namely, the nucleotide sequences of the DNAs were determined in a sequencer from 
PE Biosystems, after sequencing reaction was carried out with a DNA sequencing reagent from the same supplier 
using the custom-made synthetic DNA primers according to the manual. A part of the clones were analyzed with a 
DNA sequencer from Licor. 

55 [0216] Further, the nucleotide sequences of a part of the clones were determined by the shotgun method where the 
plasmids containing the cDNAs were digested at random were used, instead of the use of custom-made primers, by 
the same method in the DNA sequencer. The full-length nucleotide sequences were finally determined by completely 
assembling the partial nucleotide sequences obtained by the above method. 
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[021 7] Then, the regions translatable to proteins were deduced from the determined full-length nucleotide sequences, 
and thereby the amino acid sequences were determined. SEQ ID NOs corresponding to the respective sequences are 
shown in Table 1 . 

5 EXAMPLE 4 

Functional prediction by homology search 

[0218] For the determined nucleotide sequences, GenBank, SwissProt, UniGene, and nr were searched by BLAST 
10 The clones exhibiting higher homology, which were convenient to predict their functions based on the nucleotide se- 
quences and deduced amino acid sequences, were selected based on the BLAST search hit data whose P value or 
E value was 10~ 4 or lower and for which the length of consensus sequence x homology = 30 or higher in the amino 
acid database search. Further, from them, representative clones were selected, which are shown as Homology Search 
Result Data in the last part herein. Accordingly, the data shown herein are merely the representative data, and the 
15 molecule exhibiting homology to each clone is not limited thereto. Further, with respect to a part of clones, the BLAST 
search hit data that did not meet the criteria as described above are not shown herein. 

EXAMPLE 5 

20 Search for signal sequence, transmembrane domain and other functional domains in the deduced amino acid 
sequences 

[0219] With respect to the amino acid sequences deduced from the full-length nucleotide sequences, the prediction 
was made for the presence of signal sequence at the amino terminus, the presence of transmembrane domain, and 

25 the presence of functional protein domains (motifs). The signal sequence at the amino terminus was searched for by 
PSORT [K. Nakai & M. Kanehisa, Genomics, 14: 897-911 (1 992)]; the transmembrane domain, by SOSUI [T. Hirokawa 
et al., Bioinformatics, 14: 378-379 (1998)] (Mitsui Knowledge Industry); the function domain, by Pfam (http://www. 
sanger.ac.uk/Software/Pfam/index.shtml). The amino acid sequence in which the signal sequence at the amino termi- 
nus or transmembrane domain had been predicted to be present by PSORT or SOSUI were assumed to be a secretory 

30 or membrane protein. Further, when the amino acid sequence hit a certain functional domain by the Pfam functional 
domain search, the protein function can be predicted based on the hit data, for example, by referring to the function 
categories on the PROSITE (http://www.expasy.ch/cgi-bin/prosite-list.pl). In addition, the functional domain search can 
also be carried out on the PROSITE. 

[0220] The search results obtained with the respective programs are shown below. 
35 [0221] The clones whose deduced amino acid sequences were detected to have the signal sequences by PSORT 
are as follows. 

ADRGL20021910, ADRGL20036380, ADRGL20063770, ASTR020020240, BNGH420052350, BNGH420077980, 
BRACE20054080, BRACE20 194670, BRAMY20044920, BRAMY20047560, BRAMY201 37360; BRAMY20204270, 
BRAMY20237190, BRAMY20251 750, BRAWH20020470, BRAWH20093070, BRCAN1 0001 680, BRHIP1 0000720, 

40 BRSSN20091190, CD34C20001750, CTONG20059130, CTONG20069320, FCBBF30062490, FCBBF301 32660, 
FEBRA20039260, FEBRA20040230, FEBRA20040560, FEBRA20046280, FEBRA201 82030, HCHON1 0001 660, 
HCHON20015050, HEART1 0001 490, HHDPC20088160, HLUNG20032460, HLUNG20034970, HLUNG20050760, 
HLUNG20081390, HLUNG20088750, KIDNE20134130, KIDNE20143200, LIVER20007750, LIVER2001 0510, 
MESAN20021220, MESAN20027900, MESAN20095220, NT2NE20069580, NT2NE20082130, NT2NE201 67660, 

45 NT2RP70003110, OCBBF20000740, OCBBF20012520, OCBBF201 10730, OCBBF201 18720, OCBBF201 55030, 
OCBBF201 70350, OCBBF201 91950, PANCR1 0000860, PEBLM20001800, PLACE 60004260, PLACE60006300, 
PLACE60055590, PLACE6005691 0, PLACE60057860, PLACE601 04630, PLACE601 84870, PROST20050390, 
PROST20084680, PROST201 05450, PROST201 06060, PROST20110120, SKMUS20091 900, SKNMC20006350, 
SMINT20024140, SMINT20028840, SMINT20086250, SMINT2 0088440, SMINT20088690, SPLEN2001 7810, 

50 SPLEN20073880, SPLEN20080070, SPLEN20101950, SPLEN201 08000, SPLEN201 10860, SPLEN2011 8050, 
SPLEN201 38600, SPLEN201391 00, SPLEN201 93230, SPLEN201 93490, SPLEN20201 830, TESTI20043130, 
TESTI20047370, TESTI20057200, TESTI20059080, TESTI20061200, TESTI20063330, TESTI20063600, 
TESTI201 02390, TESTI201 1 61 20, TESTI201 51 800, TESTI201 66670, TESTI2021 0030, TESTI20245860, 
THYMU20020800, THYMU20046770, THYMU20050010, THYMU20054800, THYMU20055740, THYMU20083390, 

55 THYMU201 15380, TRACH20081 270, TRACH201 59390, UTERU20040150, UTERU200641 20, UTERU20086530, 
UTERU20127150 

[0222] The clones whose deduced amino acid sequences were detected to have the transmembrane domains by 
SOSUI are as follows. Numerals indicate the numbers of transmembrane domains detected in the deduced amino acid 
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sequences. Of the search result, the clone name and the number of transmembrane domains are demarcated by a 
double slash mark (//). 

ADRGL20020290//1 0, ADRGL20021 91 0//2, ADRGL20063770//2, ASTRO2001 001 0//4, ASTRO20045840//3, 
ASTRO20053430//1 , ASTRO20055530//1 , ASTRO20055570//2, ASTRO20055930//2, ASTRO20075150//1 s 

5 ASTRO20088950//1 , ASTRO20091180//4, BNGH420021 680//1 , BNGH420023870//1 , BNGH420052350//1 , 
BNGH420059680//1 , BNGH420075940//1 , BNGH420087430//3, BRACE1 000051 0//2, BRACE20052530//1 , 
BRACE20066360//3, BRACE2006871 0//1 , BRACE20069000//5, BRACE200691 1 0//1 , BRAMY1 0001 730//1 , 
BRAMY2000151 0//1 , BRAMY20003880//1 , BRAMY20024790//2, BRAMY20027390//2, BRAMY20028530//2, 
BRAMY20035380//2, BRAMY2004521 0//1 , BRAMY20050940//3, BRAMY2005391 0//2, BRAMY20055760//5 : 

10 BRAMY20072440//8, BRAMY20083820//2, BRAMY20089770//1 , BRAMY20096930//1 , BRAMY20118410//1 , 

BRAMY20123400//2, BRAMY20125550//1 , BRAMY20127310//1 , BRAMY20127760//1 , BRAMY20135720//3, 
BRAMY20139440//5, BRAMY2015251 0//1 , BRAMY20194680//2, BRAMY20204270//1 , BRAMY20225320//2, 
BRAMY20237190//1, BRAMY20251 750//1 , BRAMY20285650//2, BRAWH20020470//7, BRAWH20026010//1 , 
BRAWH20030000//2, BRAWH20039640//2, BRAWH20055330//1 , BRAWH20078620//2, BRAWH20093040//3, 

15 BRAWH20185270//5, BRAWH201 90550//1 , BRCAN10000760//10, BRCAN20001480//3, BRCAN20004180//1 , 
BRCOC20000470//2, BRCOC20003600//1 , BRHIP1 0001 040//5, BRHIP20000210//1 , BRSSN20001 970//4, 
BRSSN20074640//2, CD34C20001 750//1 , CTONG2001 7490//1 , CTONG20029030//4, CTONG20041 260//9, 
CTONG20044870//2, CTONG20045500//2, CTONG20049480//3, CTONG20051 450//1 , CTONG20056150//2, 
CTONG20059130//6, CTONG20060040//1 , CTONG20065680//1 , CTONG20071 680//1 3, CTONG20076810//1 , 

20 CTONG20078340//2, CTONG20079590//1 , CTONG20083980//1 , CTONG20084020//2, CTONG20167750//1 , 
CTONG201 68240//3 : CTONG201 79890//5, CTONG201 83830//1 , DFNES2001 8000//2, DFNES200281 70//1 , 
DFNES20029660//8, DFNES20072990//9, DFNES20080880//1 , FCBBF2001 8680//3, FCBBF20029280//2, 
FCBBF20032930//1 , FCBBF20036360//2, FCBBF20054390//2, FCBBF30022680//1 , FCBBF30042610//7, 
FCBBF30062490//2, FCBBF30075970//1 , FCBBF30078600//3, FCBBF3009541 0//1 , FCBBF301 05440//3, 

25 FCBBF301 1 8670//1 , FCBBF301 45670//1 , FCBBF301 6451 0//1 , FCBBF301 69870//1 s FCBBF301 72330//1 , 
FCBBF301 77290//2, FCBBF301 79740// 1 , FCBBF30195690//2, FCBBF30197840//1 , FCBBF30212210//1 , 
FCBBF30223110//1 , FCBBF30225930//9, FCBBF30230610//1 , FCBBF30260480//5, FCBBF30266510//3, 
FCBBF30287940//1 , FCBBF5000061 0//2, FCBBF50004950//3, FEBRA20007820//4, FEBRA2001 8670//2, 
FEBRA20031280//7, FEBRA20031810//2, FEBRA20040560//4, FEBRA20057010//1 , FEBRA20080860//1 , 

30 FEBRA20084750//1, FEBRA20115930//10, FEBRA20116650//4, FEBRA20121 950//5, FEBRA20141 980//1 , 

FEBRA201 77800//1 , FEBRA20182030//1 , HCHON20015050//1 , HEART20031680//1 , HHDPC10001140//1 , 
HHDPC20051850//1, HHDPC20082790//3, HHDPC200881 60//1 , HLUNG20015070//2, HLUNG20020850//2, 
HLUNG20029490//1 , HLUNG20032460//1 , HLUNG20033350//2, HLUNG200371 60//2, HLUNG20050760//1 , 
HLUNG20052300//1, HLUNG20060670//1 , HLUNG20065990//1 , HLUNG20074330//1 , HLUNG20081390//2, 

35 HLUNG20092530//1 , KIDNE20016360//5, KIDNE20083150//3, KIDNE20084030//12, KIDNE20084800//2, 
KIDNE20086490//1 , KIDNE20086660//2, KIDNE20094670//1 , KIDNE20142900//1 , KIDNE20148080//1 , 
KIDNE201 60960//2, KIDNE201 6371 0//1 , KIDNE201 691 80//2, KIDNE201 82540//3, KIDN E201 861 70//1 , 
KIDNE20188630//1, KIDNE20189960//1 , LI VER20 00775 0//8, LIVER20010990//4, LIVER20030650//6, 
LIVER20038000//4, MESAN20007110//1 , MESAN20008150//3, MESAN20021220//2, MESAN20060430//1 , 

40 MESAN20067430//3, MESAN200841 50//3, NT2NE2001 8740//1 , NT2NE2003921 0//1 , NT2N E20 053230// 1 , 
NT2NE2005921 0//1 , NT2NE20080770//1 , NT2NE200821 30//3, NT2NE20092950//1 , NT2NE20152620//1 , 
NT2NE20167660//1, NT2NE201 81 800//1 , NT2RI2001 6240//2, NT2RI20021200//3, NT2RI20033920//5, 
NT2RP70027790//2, NT2RP70031 070//1 0, NT2RP70031 480//6, NT2RP700871 40//1 , OCBBF20000740//2, 
OCBBF20012520//2, OCBBF201 09780//1 , OCBBF201 10210//2, OCBBF20110730//2, OCBBF20112280//3, 

45 OCBBF20118720//3, OCBBF20123200//2, OCBBF201 6591 0//9, OCBBF201 76650//1 , OCBBF201 85630//1 , 
OCBBF20191950//3, PANCRI0000860//1 , PLACE60004260//1 , PLACE60006300//2, PU^CE60053280//3 ) 
PLACE60056910//1 , PLACE60057860//3, PLACE60061370//1 , PLACE60070500//2, PLACE60104630//3, 
PLACE60107010//2, PLACE601 84870//1 , PROST20011 1 60//2, PROST20014150//2, PROST20035830//1 , 
PROST20050390//1 , PROST200651 00//7, PROST20073280//1 , PROST20082430//1 , PROST20084680//3, 

50 PROST20105450//6, PROST2011 0120//1 , PROST20114100//2, PROST2015251 0//4, PROST201 68600//6, 

PUAEN1 0000870//2, SKMUS20006790//1 , SKMUS20020770//1 , SKNMC20006350//1 , SKNSH20094350//2, 
SMINT20008110//3, SMINT20024140//1 , SMINT20026200//1 , SMINT20028840//1 , SMINT20045470//1 , 
SMINT20081 330//2, SMINT200921 60//1 , SPLEN2001 51 00//2, SPLEN2001 761 0//1 , SPLEN2001 781 0//2, 
SPLEN20024620//2, SPLEN20054500//1 , SPLEN200581 80//1 , SPLEN2008591 0//2, SPLEN20090880//2, 

55 S PLEN20 101 950//5 , SPLEN201 04690//1 , SPLEN201 051 00//2, SPLEN201 1 01 80//1 , SPLEN201 21 790//1 , 
SPLEN201 25230//1 , SPLEN201 36700//1 , S PL EN20 1 75920// 1 , SPLEN201 77400//2, SPLEN201 82850//1 , 
SPLEN201 83950//1 , SPLEN201 90080//1 , S PL EN20 1 90770// 1 , SPLEN201 93230//1 , SPLEN201 93790//1 , 
SPLEN20204670//1 , TESOP1 0000350//1 , TESTI200061 60//5, TESTI20038240//1 , TESTI20043220//1 , 
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TESTI20046540//1 , TESTI20046870//1 , TESTI20050400//1 , TESTI20051 730//7, TESTI20053260//2, 
TESTI20053780//1 , TESTI20057200//1 , TESTI20057590//1 , TESTI20061200//10, TESTI20062120//1 , 
TESTI20063330//1 , TESTI2006341 0//1 , TESTI20068530//2, TESTI20070400//1 0, TESTI20070740//1 , 
TESTI20073460//1 , TESTI20095200//6, TESTI20095880//1 0, TESTI201 00090//1 , TESTI201 02390//2, 

5 TESTI20113940//3, TESTI20121 040//1 , TESTI20149880//3, TESTI20151800//3, TESTI20162780//4, 
TESTI201 701 70//1 , TESTI201 73050//1 , TESTI201 8611 0//11 , TESTI201 98540//2, TESTI201 9911 0//1 , 
TESTI20202830//1 , TESTI20204260//1 , TESTI20214630//2, TESTI20244730//1 , TESTI20245860//1 , 
TESTI2024641 0//1 , TESTI2025791 0//1 , TESTI20260640//1 , TESTI20261 040//4, TESTI202621 50//11 , 
TESTI20262940//1 , TESTI2026491 0//1 , TESTI20271 790//9, TESTI20278280//3, TESTI20282420//2, 

10 TESTI20282900//1 , TESTI20286590//1 , THYMU20007020//3, THYMU20012020//1 , THYMU2001 7270//2, 
THYMU20020800//4, THYMU20025480//3, THYMU20030690//7, THYMU20034790//2, THYMU20046350//1 , 
THYMU20050010//5, THYMU20054800//2, THYMU20055740//1 , THYMU20062770//1 , THYMU20078240//1 , 
THYMU20079690//2, THYMU20083390//3, THYMU20087270//1 , THYMU201 00940//4, THYMU201 06990//1 , 
THYMU20115380//2, THYMU20137050//3, THYMU20137570//1 , THYMU20143230//2, THYMU20145990//3, 

15 THYMU20150190//2, THYMU20153210//1 0, THYMU20154790//3, THYMU201 63600//2, THYMU20171580//1 , 
THYMU20176010//1, THYMU201 78440//1 , THYMU20185470//2, TRACH20011540//2, TRACH20021380//4, 
TRACH20073990//1 , TRACH20081270//2, TRACH20149720//1 , TRACH201 497401/1 , TRACH20163470//11 , 
TRACH20165330//3, TRACH201 67090//2, TRACH201 73680//8, UMVEN1 0001380//2, UTERU20035770//1 , 
UTERU20045200//5, UTERU20064120//7, UTERU20086530//2, UTERU20089300//1 , UTERU20095100//3, 

20 UTERU20099040//5, UTERU201 03200//3, UTERU2012581 0//1 , UTERU201271 50//8, UTERU20139760//1 , 
UTERU20188840//2 

[0223] The Names of clones whose deduced amino acid sequences were detected to have functional domains with 
Pfam, and the name of hit functional domains are as follows. The search result is indicated as "clone name//functional 
domain name". When the clone has multiple hit functional domains, they are listed and demarcated by a double slash 
25 mark (//). When the clone has multiple hits of an identical functional domain, each is listed without abridgment. 

ADRGL20020290//Sodium:galactoside symporter family/VNucleoside transporter ADRGL20021910//lmmunoglobulin 
domain 

ADRGL20026790//PWWP domain 

ADRGL20036840//Class f Histocompatibility antigen, domains alpha 1 and 2 
30 ADRGL20047770//ATP synthase (F/14-kDa) subunit 
ADRGL20059610//0-Glycosyl hydrolase family 30 

ADRGL20062330//Spectrin repeat//Spectrin repeat//Bacterial flagellin N-terminus//Spectrin repeatvYSpectrin repeat// 
Spectrin repeat 

ADRGL20066770//Collagen triple helix repeat (20 copies)//C1q domain 
35 ADRGL20079060//Transglutaminase-like superfamily 

ASTRO20006530//lntermedie filament proteins//Myc leucine zipper domain 
ASTRO20010010//Photosynthetic reaction center protein 
ASTRO2001 0290//PHD-finger 

ASTRO20026320//Viral (Superfamily 1) RNA helicase//Heavy-metal-associated domain//Viral (Superfamily 1) RNA 
40 helicase 

ASTRO20038400//Homebox domain//Common central domain of tyros inase//Rhabdo virus nucleocapsid protein// 
Homeobox domain//Homeobox domain//Homeobox domain 
ASTRO20046280//MutT-like domain 

ASTRO20050810//FGGY family of carbohydrate kinases/ZFGGY family of carbohydrate kinases 
45 ASTRO20052420//RhoGEF domain//PH domain//SH3 domain 

ASTRO20053430//FERM domain (Band 4.1 family)//FERM domain (Band 4.1 family)//Delta-aminolevulinic acid dehy- 
dratase//Lipoprotein amino terminal region 
ASTRO20055570//Prion protein 

ASTRO20055930//Aldehyde oxidase and xanthine dehydrogenase, C terminus//Zinc finger, C3HC4 type (RING finger) 
50 ASTRO20085080//WD domain, G-beta repeat//Fibrillarin//WD domain, G-beta repeat//WD domain, G-beta repeat// 
WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat 
ASTRO20088950//Glycosyl hydrolase family 1 

BGGI120010750//Phosphoglucose isomerase//Ribosomal protein L7Ae 

BNGH420015760//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, 
55 G-beta repeat//WD domain, G-beta repeat 

BNGH420024870//C2 domain//C2 domain//C2 domain 

BNGH420035290//Zinc finger, C3HC4 type (RING finger)//TRAF-type zinc finger//Hr1 repeat motif//WD domain, G- 
beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta 
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repeat//WD domain, G-beta repeat//WD domain, G-beta repeat 
BNGH420036410//Arsenical pump membrane protein 
BNGH420046790//lmmunoglobulin domain 
BNGH420052350//Urotensin II 
5 BNGH420059680//NHL repeat//NHL repeat//NHL repeat//NHL repeat 

BNGH420070370//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type 

BNGH420074600//RNA polymerase beta subunit 

BNGH420077980//lmmunoglobulin domain//lmmunoglobulin domain//lmmunoglobulin domain//lmmunoglobulin do- 
10 main//lmmunoglobulin domain//lmmunoglobulin domain 
BNGH420086030//PH domain//RhoGAP domain 

BRACE20003310//KRABbox//Zinc finger, C2H2type//2inc finger, C2H2type//Zinc finger, C2H2type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
15 BRACE20007330//BTB/POZ domain//Kelch motif//Kelch motif//Kelch motif//Kelch motif//Kelch motif//Kelch motif 
BRACE20014450//alpha/beta hydrolase fold 
BRACE20050870//DEAD/DEAH box helicase 

BRACE20051600//Reverse transcriptase (RNA-dependent DNA polymerase) 

BRACE20051930//MAM domain. 
20 BRACE20054600//von Willebrand factor type D domain 

BRACE20055560/AWD domain, G-beta repeat/A/VD domain, G-beta repeat 

BRACE20059810//TSC-22/dip/bun family 

BRACE20061620//SPRY domain 

BRACE20065470//Ubiquitin family 
25 BRACE20069000//CLN3 protein 

BRACE20079200/A^on Willebrand factor type D domain 

BRACE20099070//FYVE zinc finger 

BRACE20196180//HMG (high mobility group) box 

BRACE20204670//Protein-tyrosine phosphatase//Dual specificity phosphatase, catalytic domain//Fatty acid desatu- 
30 rase//Protein-tyrosine phosphatase 

BRACE2021 541 0//lmidazoleglycerol-pnosphate dehydratase//UvrD/REP helicase 
BRAMY20001510//Zinc finger, C3HC4 type (RING fingerr//PHD-finger 

BRAMY20003540//PH domain//EF hand//EF hand/A/iral RNA dependent RNA polymerase//Phosphatidylinositol-spe- 

cific phospholipase C, X domain//Phosphatidylinositol-specific phospholipase C, Y domain//Bleomycin resistance pro- 
35 tein//C2 domain 

BRAMY20005080//Ubiquitin carboxyl-terminal hydrolase family 2 

BRAMY20013670//S-adenosylmethionine synthetase 

BRAMY20016780//Proprotein convertase P-domain 

BRAMY20023640/AJBX domain 
40 BRAMY20027990//C2 domain 

BRAMY20028620//Quinolinate phosphoribosyl transferase 

BRAMY20035380//Cation efflux family 

BRAMY20035830//BTB/POZ domain/ZThymidylate synthase 

BRAMY20038980//Granulocyte-macrophage colony-stimulating factor//Borrelia outer surface protein E and F 
45 BRAMY20040580//Zinc finger, C2H2 type//Zinc finger, C2H2 type 

BRAMY20043630//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine 

Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich 

Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat 

BRAMY20044920//Ubiquitin carboxyl-terminal hydrolase family 2 
50 BRAMY20045420//Domain found in Dishevelled, Egl-10, and Pleckstrin 

BRAMY20051 820//C2 domain 

BRAMY20056620//Ca transferase domain 

BRAMY20056840//PWWP domain 

BRAMY20076100//Ligand-binding domain of nuclear hormone receptor 
55 BRAMY20089770//ATP P2X receptor 

BRAMY20091230//Mitochondrial carrier proteins//Mitochondrial carrier proteins 
BRAMY20094890//SURF4 family 
BRAMY201 02900//Ephrin 
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BRAMY20111780//Zinc finger, C2H2 type//Transcription factor S-ll (TF!IS)//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C3HC4 type (RING finger)//Zinc finger, C2H2 type// 
Zinc finger, C2H2 type 

BRAMY20117670//DnaJ central domain (4 repeats)//DnaJ C terminal region 
BRAMY20118410//Phospholipase D. Active site motif 
BRAMY20118490//FGGY family of carbohydrate kinases 
BRAMY201 25360//Asparaginase 

BRAMY20134050//Nucleosome assembly protein (NAP) 
BRAMY201 43870//Peptidyl-tRNA hydrolase 

BRAMY20152510//Protein-tyrosine phosphatase//Dual specificity phosphatase, catalytic domain 
BRAMY20158550//EF hand//EF hand 

BRAMY20206340//WD domain, G-beta repeat//Dockerin domain type I//WD domain, G-beta repeat//WD domain, G- 

beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat 

BRAMY20227860//lnsulin-like growth factor binding proteins//Spectrin repeat 

BRAMY20234820//Ribosomal L25p family 

BRAMY20238630//TPR Domain//TPR Domain//TPR Domain 

BRAMY20244490//Adenylate kinase 

BRAMY20245140//Cyclic nucleotide-btnding domain 

BRAMY20251210//Ephrin receptor ligand binding domain//EB module 

BRAMY20263000//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, 
G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat 
BRAMY20274510//Ribosomal protein L11 

BRAWH20014590//RNA recognition motif, (a. k. a. RRM, RED, or RNP domain) 
B RAWH20021 91 0//Carboxylesterases 

BRAWH20026010//Hepatitis C virus RNA dependent RNA polymerase 

BRAWH20039640//Leucine rich repeat N-terminal domain//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich 
Repeat//Leucine Rich Repeat//Leucine Rich RepeatZ/Leucine Rich Repeat//Leucine rich repeat C-terminal domain// 
Leucine rich repeat N-terminal domain//Leucine Rich RepeaWLeucine Rich Repeat//Leucine Rich Repeat//Leucine 
Rich Repeat//Leucine Rich Repeat//Leucine Rich RepeatZ/Leucine rich repeat C-terminal domain 
BRAWH20040680//Zinc finger, C3HC4 type (RING finger)//DENN (AEX-3) domain 
BRAWH20047790//HMG (high mobility group) box 
BRAWH20050740//BTB/POZ domain 

BRAWH20080580//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type// 
DnaJ central domain (4 repeats)//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, 
C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
BRAWH20093040//Eukaryotic protein kinase domain 
BRAWH20094900//BNR repeat//BNR repeat//BNR repeat 

BRAWH20095900//KRAB box//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Transcription factor S-ll (TFIIS)//Zinc 
finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type//Zinc finger, C2H2 type//PHD-finger//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
BRAWH20183170//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat 
BRAWH201 85260//Troponin//Serine hydroxymethyltransf erase 
BRAWH20185270//Uncharacterized protein family UPF0057 
BRAWH20188750//Glypican//ubiE/COQ5 methyltransferase family 
BRAWH201 91 980//Proline dehydrogenase 
BRCAN10001050//Celi division protein 

BRCAN2000541 0//Zinc finger, C4 type (two domains)//Zinc finger, C2HC type//SAM domain (Sterile alpha motif)// 
Sterile alpha motif (SAM)/Pointed domain 
BRCOC20000470//TPR Domain//TPR Domain 
BRSSN20005610//PDZ domain (Also known as DHR or GLGF). 
BRSSN20005660//Bacterial type II secretion system protein 

BRSSN20066440//SCAN domain//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
BRSSN20074640//Peptidase family M48 
BRSSN20093890//Kelch motif//Kelch motif 

CD34C20001750//lmmunoglobulin domain//lmmunoglobulin domain 

CTONG20004110//Ank repeat//Ank repeat//Ank repeat//Ank repeat//Ank repeat//Tropomyosins//Transient receptor// 
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Apolipoprotein A1/A4/E family//MutS family, N-terminat putative DNA binding domain//K-box region//Outer membrane 
efflux protein 

CTONG20004520//SH3 domain 

CTONG20007660//Caspase recruitment domain//DNA polymerase (viral) N-terminal domain//bZIP transcription fac- 
5 tor//K-box region 

CTONG20008190//ADP-ribosylation factor family//Ras family 

CTONG20017490//Sema domain//Sema domain//lntegrins, beta chain//Plexin repeat 

CTONG20020950//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
10 CTONG20029030//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich RepeaWLeucine Rich Repeat//Leucine 
Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat 

CTONG20030280//WD domain, G-beta repeat//Gram-negative pili assembly chaperone//WD domain, G-beta repeat// 

WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat 

CTONG20032930//Armadilto/beta-catenin-like repeats 
15 CTONG2003361 0//RyR domain 

CTONG20033750//FtsK/SpolllE family//ATPases associated with various cellular activities (AAA) 

CTONG20036990//lmmunoglobulin domain//lmmunoglobulin domain 

CTONG20044230//Zinc finger, C2H2 type//Zinc finger, C2H2 type 

CTONG20044870//PH domain//PH domain 
20 CTONG20045500//Translation initiation factor IF-3//HC03- transporter family//HC03- transporter family//Domain of 

unknown function DUF139 

CTONG20046690//Src homology domain 2 

CTONG20049480//bZIP transcription factor//Carbamoy I -phosphate synthase (CPSase)//tRNA synthetases class I (C) 
CTONG20060040//NusB family 
25 CTONG20063770//KE2 family protein//Spectrin repeat 

CTONG20063930//SH3 domain//WW domain/AWW domain//PH domain//RhoGAP domain 
CTONG20066110//TPR Domain//TPR Domain 

CTONG20068360//Mitochondrial carrier proteins//Mitochondrial carrier proteins 
CTONG20069420//Ribosomal protein S14p/S29e 
30 CTONG20070090//Bacterial luciferase 

CTONG20070720//PH domain//RhoGAP domain//bZIP transcription factor 
CTONG20070910//PCI domain 

CTONG20071040//Beta/Gamma crystallin//Beta/Gamma crystallin//Beta/Gamma crystallin//Beta/Gamma crystallin// 
Similarity to lectin domain of ricin beta-chain, 3 copies. 
35 CTONG20071 Q&0JI3 % -5 1 exonuclease//Cytochrome c oxidase subunit MI//Ammonium Transporter Family//7 transmem- 
brane receptor (Secretin family) 

CTONG20072930//KRAB box//Ribosomal protein L20//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, 
C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Transcription factor S-ll (TFIIS)//Zinc finger, C2H2 type// 
Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type// 
40 Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Putative zinc finger in N-recognin//Zinc finger, 
C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
CTONG20074000//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type 

CTONG20074170//DENN (AEX-3) domain 
45 CTONG2007681 0//Ribosomal protein L1 6//Pheromone A receptor 

CTONG20079590//Sialyltransferase family 

CTONG20083430//Nuclear transition protein 2 

CTONG20083980/A/VH1 domain 

CTONG20084660//SCAN domain 
50 CTONG2008521 0//Lipase (class 3) 

CTONG20165750//G-patch domain//Double-stranded RNA binding motif 

CTONG20169040//bZIP transcription factor//Adenylate cyclase//! ntermediate filament proteins 

GTONG20170940//Ank repeat//Ank repeat//Ank repeat//SAM domain (Sterile alpha motif) 

CTONG201 74580//TBC domain 
55 CTONG201 76040//ADP-ribosylation factor family//Ras family 

CTONG20180690//Collagen triple helix repeat (20 copies) 

CTONG20183430//Ank repeat//Ank repeat//Ank repeat//Ank repeat//Ank repeat//Ank repeat//Ank repeat//Ank repeat// 
Zinc finger, C3HC4 type (RING finger)//Zinc finger, C3HC4 type (RING finger)//Zinc finger, C3HC4 type (RING finger) 
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CTONG20184830//ABC transporter 
CTONG20186290//Aldehyde dehydrogenase family 

CTONG20186370//KRAB box//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, 
C2H2 type 

s CTONG20186520//KRAB box//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 typevVZinc finger, 
C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, 
C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Transcription factor S-il (TFIIS)// 
Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
CTONG20186550//BTB/POZ domain//K+ channel tetramerisation domain 

10 CTONG201 88080//TPR Domain 

CTONG20189000//RhoGEF domain 

CTONG20190290//R3H domain//RNA dependent RNA polymerase/VUncharacterized protein family UPF0024 

DFNES20025500//Sigma-54 interaction domain//ATPases associated with various cellular activities (AAA) 

DFNES20043710//Src homology domain 2//Domain of unknown function DUF36 
15 DFNES20055400//Viral coat protein//Putative diphthamide synthesis protein//! nf I uenza non-structural protein (NS1) 

DFNES20057660//Plant thionins//Mitochondrial carrier proteins//Mitochondrial carrier proteins 

DFNES20072990//lntegral membrane protein DUF6// Integral membrane protein DUF6 

DFNES20073320//Zinc finger, C3HC4 type (RING finger)//PHD-finger//B-box zinc finger. 

DFNES20076340//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
20 DFNES20080880//G!ycosyl transferases//Similarity to lectin domain of ricin beta-chain, 3 copies. 

DFNES20094820//PHD-finger//Zn-finger in ubiquitin-hydrolases and other proteins 

FCBBF10000230//Sec7 domain//PH domain 

FCBBF10004760//HMG (high mobility group) box 

FCBBF20018680//C2 domain//C2 domain 
25 FCBBF20020440//Urease 

FCBBF20023490//Helicases conserved C-terminal domain 
• FCBBF20033360//BTB/POZ domain//Kelch motif//Kelch motif//Kelch motif//Kelch motif//Kelch motif 

FCBBF20035430//AN1-like Zinc finger//AN1-like Zinc finger 

FCBBF20035490//KH domain 
30 FCBBF20041 380//SAM domain (Sterile alpha motif) 

FCBBF20043730//UBA domain 

FCBBF20059660//TPR Domain 

FCBBF20066340//PH domain 

FCBBF20070950//DNA binding domain with preference for A/T rich regions 
35 FCBBF30001100//DENN (AEX-3) domain//PPR repeat 
FCBBF30002270//linkerhistone H1 and H5 family 

FCBBF30003610//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Plant PEC family metal- 
lothionein//DM DNA binding domain//Zinc finger, C2H2type//Zinc finger, C2H2type//Zinc finger, C2H2type//Zinc finger, 
C2H2 type//PHD-finger//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Phorbol esters/diacyl glycerol binding domain 
40 (C1 domain) 

FCBBF30004340//Ribosomal protein S3, C-terminal domain. //Similarity to lectin domain of ricin beta-chain, 3 copies. 
FCBBF30005360//Sigma-54 interaction domain//ATPases associated with various cellular activities (AAA)//ATPases 
associated with various cellular activities (AAA) 

FCBBF30005500//PH domain//PH domain//Putative GTP-ase activating protein for Arf 
45 FCBBF30019140// , chromo' (CHRromatin Organization Modifier) domairWchromo' (CHRromatin Organization Modifi- 
er) domai n//D E AD/D E AH box helicase//SNF2 and others N-terminal domain/ZHelicases conserved C-terminal domain 
FCBBF30019180//Armadillo/beta-catenin-like repeats//Lipoprotein amino terminal region 

FCBBF30021900//Zinc finger, C2H2 type//Zinc finger, C2H2 type//PHD-finger//Zinc finger, C2H2 type//Zinc finger, 
C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Bo!A-like protein//Zinc finger, C2H2 type//Zinc finger, C2H2 
50 type//Snake toxin//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type FCBBF30022680//3 , -5 l ex- 
onuclease 

FCBBF30029250//SET domain 
FCBBF30035570//C2 domain 
55 FCBBF300484207/T-box//wnt family of developmental signaling proteins 

FCBBF30071500//lnfluenza RNA-dependent RNA polymerase subunit PB1//Reprolysin family propeptide//Leptin 
FCBBF30076310//Eukaryotic protein kinase domain//Eukaryotic protein kinase domain//Protein kinase C terminal do- 
main 
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FCBBF30079770//D-isomer specific 2-hydroxyacid dehydrogenases 

FCBBF30080730//RNA recognition motif, (a.k.a. RRM, RBD. or RNP domain)//Zinc knuckle 

FCBBF30085560//Pyridine nucleotide-disulphide oxidoreductase//FAD binding domain//Flavin containing amine oxi- 
dase//Phytoene dehydrogenase related enzyme 

FCBBF300931 70//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type 
FCBBF30100080//Sec7 domain 

FCBBF30100120//PD2 domain (Also known as DHR or GLGF). 
FCBBF30100410//Nucleosome assembly protein (NAP) 

FCBBF3011 86707/Reprolysin (M12B) family zinc metalloprotease//Disintegrin//EB module//Hantavirus glycoprotein 
G2//Adenovirus E3 region protein CR2//jmjN domain 
FCBBF30125460//Zinc finger, C3HC4 type (RING finger) 

FCBBF30129010//KRAB box//Zinc finger, C2H2 type//GATA zinc finger//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Transcription factor S-ll (TFIIS)//Zinc finger, C2H2 type 
FCBBF30130410//UvrB/uvrC motif 
FCBBF30132050//Galactosyltransferase//Fringe-like 

FCBBF301326607/I_eucine rich repeat N-terminal domain//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich 
Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat 
FCBBF30136230//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type// 
Cystatin domain//Homeobox domain//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
FCBBF301 42290//PHD-f inger 

FCBBF301 43550//Phosphatidylinositol-4-phosphate 5-Kinase 

FCBBF30153170//Phosphofructokinase//Phosphofructokinase//Phosphofructokinase//Phosphofructokinase 
FCBBF30161780//gag gene protein p24 (core nucleocapsid protein )//Zinc knuckle 

FCBBF30164510//Cadherin domain//Cadherin domain//Cadherin domain//Fructose-bisphosphate aldolase class-l// 

Cadherin domain//Cadherin domain//Alphaherpesvirus glycoprotein E//Cadherin cytoplasmic region 

FCBBF301 66220//Serine hydroxymethy transferase 

FCBBF30169280//PHD-finger//Zinc finger, C3HC4 type (RING finger) 

FCBBF30171230//Subtilase family//Pro protein convertase P-domain 

FCBBF301 73960//Beta-lactamase//TPR Domain 

FCBBF301 945507/Arsenical pump membrane protein//Ank repeat//Ank repeat//Ank repeat//Ank repeat//Ank repeat// 
Dehydrogenase E1 component//Ank repeat//Ank repeat//Ank repeat//Ank repeat//Ank repeat//Ank repeat//Ank repeat 
FCBBF30195690//C2 domain 

FCBBF301 978407/Sushi domain (SCR repeat)//CUB domain//Sushi domain (SCR repeat)//CUB domain//Sushi domain 

(SCR repeat)//Sushi domain (SCR repeat)//Sushi domain (SCR repeat) 

FCBBF3021 221 07/lmmunoglobulin domain 

FCBBF30215240//PH domain//FERM domain (Band 4.1 family) 

FCBBF30220050//Ligand-binding domain of nuclear hormone receptor 

FCBBF30222910//Corticotropin-releasing factor family 

FCBBF30236670//DEAD/DEAH box helicase 

FCBBF30250980//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, 
G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat 

FCBBF30255680//Vesicular monoamine transporter//GGL domain//Ezrin/radixin/moesin family//Ank repeat//Ank re- 
peat 

FCBBF30257370//Gem in i virus AL1 protein//Outer membrane efflux protein 

FCBBF30259050//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger present in dys- 
trophin, CBP/p300//Zinc finger, C2H2 type//PHD-finger//Zinc finger, C2H2 type//Zinc finger, C2H2 type//DM DNA bind- 
ing domain//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Transcrip- 
tion factor S-ll (TFIIS)//Zinc finger, C2H2 type//TRAF-type zinc finger//Zinc finger, C2H2 type//Zinc finger, C2H2 type// 
Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 

FCBBF30260210//SNF2 and others N-termina domain//Helicases conserved C-terminal domain//Domain of unknown 
function DUF94 

FCBBF30263080//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
FCBBF30266510//Domain of unknown function DUF71 

FCBBF30271990//Ank repeat//Ank repeat//Ank repeat//Ank repeat//Ank repeat//Ank repeat//Ank repeat//Ank repeat// 
Ank repeat//Ank repeat//Ank repeat//Ank repeat 
FCBBF30275590//LIM domain containing proteins 

FCBBF30282020//Ank repeat//Ank repeat//Ank repeat//K+ channel tetramerisation domain//BTB/POZ domain 
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FCBBF30285930//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//2inc finger, C2H2 type//Zinc 
finger, C2H2 type 

FCBBF50001650//MORN motif//MORN motif//MORN motif//MORN motif//MORN motif//MORN motif//MORN motif// 
MORN motif 

FCBBF50003530//Zinc finger, C3HC4 type (RING finger) 
FCBBF50004950//Putative replicase 1 (ORF2) 

FEBRA20005040//lntermediate filament proteins//Hr1 repeat motif//Troponin//GrpE 
FEBRA20007820//DNA polymerase family B 

FEBRA20018670//Viral methyltransferase//Riboso protein 519//Alpha-2-macroglobulin family N-terminal region 
FEBRA20026820//KRABbox//Zincfinger, C2H2type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Putative zinc finger in N-recognin//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 

FEBRA20027070//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Transcription factor S-ll (TFIIS)//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//TRAF-type zinc f inger//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type 

FEBRA20029620//Leucine Rich Repeat//Leucine Rich RepeatZ/Leucine Rich Repeat 

FEBRA20031000//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine 
Rich Repeat/ZLeucine Rich Repeat//lnsulin/IGF/Relaxin family//Ribosomal RNA adenine dimethylases//SAM domain 
(Sterile alpha motif)//TFIIE alpha subunit//Zinc finger, C3HC4 type 
(RING finger) 

FEBRA20031280//Protein of unknown function DUF82 
FEBRA20038330//Corticotropin-releasing factor family 

FEBRA20038970//Laminin EGF-like (Domains III and V)//Pho esters/diacylglycerol binding domain (C1 domain)//EGF- 
like domain//EGF-like domain/ZTrypsin Inhibitor like cysteine rich domain//Metallothionein//EGF-like domain//EGF-like 
domain//Extracellular link domain 

FEBRA20046200//Ank repeat//Ank repeat//Ank repeat//Ank repeat//Ank repeat//Ank repeat//Ank repeat 
FEBRA20046510//Zincfinger, C2H2 type//Zinc finger, C2H2type//BolA-like protein//Zinc finger, C2H2 type//Zinc finger, 
C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, 
C2H2 type 

FEBRA20063720//KRAB box//Zinc finger, C2H2 type//Zinc finger, C2H2type//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 
type 

FEBRA20078800//NADH ubiquinone oxidoreductase, 20 Kd subunit 
FEBRA20080860//Hantavirus glycoprotein G2 

FEBRA20087550//WD domain, G-beta repeat//WD domain, G-beta repeat 
FEBRA20088610//CRAL/TRIO domain. 
FEBRA20088810//Fibroblast growth factor 
FEBRA20090160//Nuclear transition protein 2 

FEBRA20090220//Nucleotidyl transferase//Bacterial transferase hexapeptide (four repeats)//Bacterial transferase 
hexapeptide (four repeats)//Bacterial transferase hexapeptide (four repeats)//Domain of unknown function DUF29// 
Peptide hormone//elF4-gamma/elFS/elF2-epsilon 

FEBRA20092760//LIM domain containing proteins//LIM domain containing proteins//LIM domain containing proteins 
FEBRA20115930//Divalent cation transported/Translation initiation factor IF-3//Divalent cation transporter 
FEBRA20150420//Transient receptor//Borrelia ORF-A//Transient receptor 

FEBRA20170240//SCAN domain//KRAB box//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type// 
Zinc finger, C2H2 type//Zinc finger, C2H2 type 

FEBRA20172230//Phospholipase D. Active site motif//Phospholipase D. Active site motif 
FEBRA20173330//Eukaryotic protein kinase domain 
FEBRA20175330//D-isomer specific 2-hydroxyacid dehydrogenases 
FEBRA201 91720//GGL domain//Regulator of G protein signaling domain 

HCHON1 00001 50//LIM domain containing proteins//LIM domain containing proteins//LIM domain containing proteins// 
Zinc finger, C3HC4 type (RING finger) 
HCHON20000870//Eukaryotic protein kinase domain 

HCHON20002650//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
HCHON20002710//Ubiquitin carboxyl-terminal hydrolase family 2 
HCHON20015050//von Willebrand factor type A domain//! PT/TIG domain//Sm protein 
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HEART10001420//MYND finger//SET domain 
HEART10001490//FAD binding domain 
HEART20009590//Peptidase family M41 

HEART20019310//Zinc finger, C3HC4 type (RING finger)//PHD-finger//B-box zinc finger. 
5 HEART20022200//lnfluenza Matrix protein (M1)//metallopeptidase family M24 

HEART20047640//3'5'-cyclic nucleotide phosphodiesterase//Formin Homology 2 Domain 

HEART20082570//C-5 cytosine-specific DNA methylase//Glycine cleavage T-protein (aminomethyl transferase) 
HHDPC20081230//RNA recognition motif, (a.k.a. RRM, RBD, or RNP domain)//RNA recognition motif, (a.k.a. RRM, 
RBD, or RNP domain)//RNA recognition motif, (a. k. a. RRM, RBD, or RNP domain)//RNA recognition motif, (a. k. a. 
w RRM, RBD, or RNP domain) 

HHDPC20088160//PDZ domain (Also known as DHR or GLGF). 
HLUNG20008460//bZIP transcription factor 
HLUNG20011260//Eukaryotic protein kinase domain 

HLUNGZOOI1460//RNA recognition motif, (a.k.a. RRM, RBD, or RNP domain)//RNA recognition motif, (a.k.a. RRM, 
15 RBD, or RNP domain) 

HLUNG20014590//Zinc finger, C2H2type//Zinc finger, C2H2 type//Putative zinc finger in N-recognin//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Transcription factor S-ll (TFIIS)//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
HLUNG20015070//Leucine rich repeat N-terminal domain//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich 
20 RepeatZ/Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine rich repeat 
C-terminal domain//lmmunoglobulin domain 
HLUNG20024050//Rubredoxin 

HLUNG20028110//Zinc finger, C3HC4 type (RING finger)//TPR Domain//TPR Domain//TPR Domain//Zinc finger, 
C3HC4 type (RING finger)//Aldo/keto reductase family 
25 HLUNG20030420//Ank repeat//Ank repeaWAnk repeat//Ank repeat//Ank repeat 

HLUNG20032460//Pro oligopeptidase family//Lipoate-protein ligase B//alpha/beta hydrolase fold 
HLUNG20033060//HMG (high mobility group) box 

HLUNG20041590//TPR Domain//TPR Domain//Domain of unknown function DUF27 
HLUNG20042730//Cytochrome P450 
30 HLUNG20051330//FHIPEP family 

HLUNG20063700//Progesterone receptor 

HLUNG20065990//bZIP transcription factor//SNAP-25 family//Syntaxin 
HLUNG20068120//Fimbrial Usher proteins 
HLUNG20069350//EF hand//EF hand//EF hand 
35 HLUNG20070410//Dihydropyridine sensitive L-type calcium channel (Beta subunit) 
HLUNG20081390//DnaJ domain 
HLUNG20082350//PH domain//START domain 

HLUNG20083330//DNA polymerase III beta subunit//Fibrillar collagen C-terminal domain 
HLUNG20084790//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat 

40 HLUNG2008521 0//C2 domain 

KIDNE20016360//PAS domain//PAC mottf//lon transport protein/ZTransmembrane region cyclic Nucleotide Gated 
Channel//Cyclic nucleotide-binding domain KIDNE20027980//SAM domain (Sterile alpha motif) 
KIDNE20080690//Glycosyl hydrolases family 1 8//Aminotransferases class-Ill pyridoxal-phosphate//M ethyl-accepting 
chemotaxis protein (MCP) signaling domain//Amin ©transferases class-Ill pyridoxal-phosphate 

45 KIDNE20081170//Kinesin motor domain//Kinesin motor domain 
KIDNE20083620// Asparaginase 
KIDNE20084030//Sugar (and other) transporter 

KIDNE20084730//Neuraxin and MAPI B proteins/ZFormin Homology 2 Domain 
KIDNE20086490//gag gene protein p24 (core nucleocapsid protein) 
50 KIDNE20087880//Reverse transcriptase (RNA-dependent DNA polymerase) 
KIDNE20088240//EF hand//EF hand//EF hand 

KIDNE20089870//WD domain, G-beta repeat/AWD domain, G-beta repeat/AA/D domain, G-beta repeat//WD domain, 
G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat 

KIDNE20094670//Sigma-54 interaction domain//FtsK/SpolllE family//ATPases associated with various cellular activi- 
55 ties (AAA) 

KIDNE20133880//G-protein alpha subunit 
KIDNE20141700//Ribosomal family S4e 

KIDNE20142900//EGF-iike domain//EB module//EGF-like domain//EGF-like domain 
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KIDNE20148080//Zinc finger, C3HC4 type (RING finger) 
KIDNE20149780//Ank repeat//Ank repeat//Ank repeat 
KIDNE20152440//Trypsin//PDZ domain (Also known as DHR or GLGF). 

KIDNE20154330//PDZ domain (Also known as DHR or GLGF).//PDZ domain (Also known as DHR or GLGF).//PDZ 
5 domain (Also known as DHR or GLGF).//PDZ domain (Also known as DHR or GLGF).//PDZ domain (Also known as 
DHR or GLGF).//PDZ domain (Also known as DHR or GLGF). 
KIDNE20160360//Sec7 domain//PH domain 

KIDNE20169180//EGF-like domain//EB module//EGF-like domain//Trypsin Inhibitor like cysteine rich domain//EGF- 
like domain//TNFR/NGFR cysteine-rich region//Zona pellucida-like domain 
io KIDNE20170400//Leucine Rich Repeat//Leucine Rich RepeatZ/Leucine Rich Repeat//Protein kinase C terminal do- 
main//Rubredoxin 

KIDNE20173150//Acetyltransferase (GNAT) family 

KIDNE20173430//PDZ domain (Also known as DHR or GLGF).//PDZ domain (Also known as DHR or GLGF). 
KIDNE20186170//UDP-glucoronosyl and UDP-glucosyl transferases 
15 KIDNE20189960//Trehalase 

L1VER20006260//KRAB box//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Ztnc finger, C2H2 
type//Zinc finger C2H2 type//Zinc finger, C2H2 type//Plant PEC family metallothionein//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type 

LIVER20007750//Sugar (and other) transporter 
20 LIVER2001 0760//Lectin C-type domain 

LIVER20010990//DNA gyrase/topoisomerase IV, subunit A 

LIVER20013890/ARNA synthetases class I (C) 

LIVER20026440//Cytochrome P450 

LIVER20030650//General diffusion Gram-negative porins 
25 LIVER20038000//Mitochondrial carrier proteins//Mitochondrial carrier proteins/YMitochondrial carrier proteins 

LIVER20040740//CRAL/TRIO domain. 

LIVER20055270//AIR synthase related protein 

MESAN20006200//Annexin//Annexin 

MESAN20008150//IBR domain 
30 MESAN20009090//CUB domain 

MESAN20016270//KRAB box//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, 

C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, 

C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 

MESAN20021130//SH3 domain//Eukaryotic protein kinase domain 
35 MESAN20021220//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat/YLeucine Rich Repeat//Leucine 

Rich Repeat//Leucine Rich Repeat//Leucine rich repeat C-terminal domain//Leucine rich repeat N-terminal domain// 

Leucine Rich Repeat/ZLeucine Rich RepeatZ/Leucine Rich RepeatZ/Leucine Rich Repeat//Leucine Rich Repeat/ZLeu- 

cine rich repeat C-terminal domain 

MESAN20026870//PAN domainZ/TBC domain 
40 MESAN20027240//RhoGEF domain 

MESAN20027900//von Willebrand factor type A domain//von Willebrand factor type A domain//von Willebrand factor 

type A domain//Protozoan/cyanobacterial globin//von Willebrand factor type A domain//von Willebrand factor type A 

domain 

MESAN20030350//LGN motif, putative GEF specific for G-alpha GTPase//DNA gyrase/topoisomerase IV, subunit A 
45 MESAN20030370//Porphobilinogen deaminase//GHMP kinases putative ATP -binding proteinsZ/Protein of unknown 
function DUF113 

MESAN20034440//Viral DNA-binding proteinZ/Uncharacterized protein family UPF0024 

MESAN20038520//Caspase recruitment domain//RNA polymerase beta subunit 

MESAN20057240//Elongation factor TS//Helix-hairpin-helix motif. 
50 MESAN20058110//FKBP-type peptidyl-prolyl cis-trans isomerases//Elongation factor Tu family//FKBP-type peptidyl- 

prolyl cis-trans isomerases//FKBP-type peptidyl-prolyl cis-trans isomerases 

MESAN20059570//EGF-like domain//SEA domain//lmmunoglobulin domain 

MESAN20060430//SET domain 

MESAN20067430//Tropomyosins 
55 MESAN20069530//Calponin homology (CH) domain//PDZ domain (Also known as DHR or GLGF). 

MESAN20090190//EGF-like domain//EGF-like domain//EGF-like domain/ZEGF-like domain/ZEGF-like domainZZEGF- 

like domain//Metallothionein//CUB domain 

MESAN20095220//Adaptin N terminal region 
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MESAN20095800//Cyclophilin type peptidyl-prol cis -trans isomerase//RNA recognition motif, (a.k.a. RRM, RBD, or 
RNP domain) 

NT2NE20018890//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, 
G-beta repeat 
5 NT2NE20026200//Transketolase 

NT2NE20026510//Zinc finger, C3HC4 type (RING finger) 

NT2NE20038870//Zinc finger, C2H2 type//Zinc finger, C2H2 type//2inc finger, C2H2 type//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Lipoprotein 

NT2NE20042550//Virai (Superfamily 1) RNA helicase//NB-ARC domaiiV/Adenylate kinase//Adenylate kinase 
10 NT2NE20053950//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type 

NT2NE20060750//KRAB box//Zinc finger, C2H2 type 

NT2NE20061030//SCAN domain 
15 NT2NE20077270//Adenovirus EB1 55K protein / large t-antigen 

NT2NE20079670//KRABbox//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 

type//Transcription factor S-ll (TFIIS)//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 

NT2NE20087270//Tudor domain 

NT2NE20092950//Fibronectin type III domain 
20 NT2NE20108420//Domain of unknown function DUF130//Oxysterol-binding protein 

NT2NE20117580//NADH ubiquinone oxido reductase, 20 Kd subunit 

NT2NE201 27900//Myo-inositol-1 -phosphate synthase 

NT2NE20140130//Sema domain 

NT2NE20145250//Stathmin family 
25 NT2NE20153620//FERM domain (Band 4.1 family )//PH domain 

NT2R 12009301 0//Tetrahydrofolate dehydrogenase/cyclohydrolase 

NT2RP70001120//Ank repeat//Ank repeat//Ank repeat//lnsulinase (Peptidase family M16) 
NT2RP70001730//BTB/POZ domain//Kelch motif//Kelch motif//Kelch motif//Kelch motif//Kelch motif 
NT2RP70003110//Collagen triple helix repeat (20 copies)//Heavy-metal-associated domain 
30 NT2RP70027790//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine 
Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich 
Repeat 

NT2RP70029780//KRABbox//Zinc finger, C2H2 type//Zinc finger, C2H2 type//PHD-fi nger//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Transcription factor S-ll 
35 (TFIIS) 

NT2RP70030840//Viral (Superfamily 1) RNA helicase 
NT2RP70031 070//Nucleoside transporter 
NT2RP70031 340// PHD -finger 

NT2RP70046410//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
40 NT2RP70056690//Thrombospondin type 1 domain//Domain of unknown function DUF1 8//Thrombospondin type 1 do- 
main//Thrombospon din type 1 domain/ZThrombospondin type 1 domain//Thrombospondin type 1 domain//Keratin, high 
sulfur B2 protein 

NT2RP70057500//KRABbox//Zinc finger, C2H2 type//Zinc finger, C2H2type//Zinc finger, C2H2type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//DnaJ central do- 
^5 main (4 repeats)//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type 

NT2RP70064570//Calpain family cysteine protease//Calpain large subunit, domain III//EF hand//EF hand 
NT2RP70075300//KRAB box//Domain of unknown function DUF19//Zinc finger, C2H2 type//Zinc finger, C2H2 type// 
50 Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type// 
Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 

NT2RP70075800//recA bacterial DNA recombination proteins//WD domain, G-beta repeat//WD domain, G-beta re- 
peat//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat// 
SH3 domain 

55 NT2RP70090870//KRABbox//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//DM DNA binding domain//Zinc finger, 
C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Transcrip- 
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tion factor S-ll (TFIIS)//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 

NTONG20005310//Ribosomal protein S9/S16 

NTONG20029850//EF hand//EF hand//EF hand 

NTONG20031580//Hsp20/alpha crystallin family 
5 NTONG200321 00//lntermediate filament proteins 

NTONG20034540//GAF domain//GAF domain//3'5'-cyclic nucleotide phosphodiesterase 

NTONG20035150//BTB/POZ domain//Kelch motif//Kelch motif//Kelch motif//Kelch motif//Kelch motif 

NTONG20043080//lmmunoglobulin domatn//lmmunoglobulin domain//lmmunoglobulin domain 

NTONG20048440//PH domain//K-box region 
10 NTONG20053630//DNA binding domain with preference for A/T rich regions//Zinc finger, C2H2 type//Zinc finger C2H2 

type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Progesterone receptor/ZZinc finger, C2H2 type 

NTONG20053730//Ribosomal protein 52//Ubiquitin carboxyl-terminal hydrolases family 2//Ubiquitin carboxy I -terminal 

hydrolase family 2 

NTONG2005391 0//Translin family//PH domain//Putative GTP-ase activating protein for AnV/Ank repeat//Ank repeat 
15 NTONG20055200//Calcium channel extracellular region//Elongation factor Tu family//Elongation factor G C-terminus 
NTONG20058010//AMP-binding enzyme 

OCBBF20000740//Leucine rich repeat N-terminal domain//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich 
RepeaV/Leucine Rich Repeat//Leucine Rich Repeat//Leucine rich repeat C-terminal domain//lmmunoglobulin domain 
OCBBF20011860//LIM domain containing proteins 
20 OCBBF20012520//Leucine rich repeat N-terminal domain//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich 
Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine rich repeat C-terminal domain//lmmunoglobulin domain 
OCBBF20016390//Chitin synthase//Fibronectin type II domain 
OCBBF200 16810//* Paired box' domain 

OCBBF20110210//lmmunoglobulin domain/Zlmmunoglobulin domain//lmmunoglobulin domain 
25 OCBBF20113110//AP endonuclease family 1 

OCBBF20116250//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type//PHD-finger//Zinc finger, C2H2 type 
OCBBF201 2001 0//Thrombospondin type 1 domain 

OCBBF20120950//K+ channel tetramerisation domain//BTB/POZ domain//Zinc finger, C2H2 type//Zinc finger, C2H2 
30 type 

OCBBF20121910//Ribosomal protein L24e 
OCBBF20147070//DNA polymerase (viral) C-terminal domain 

OCBBF20156450//KRABbox//Zinc finger, C2H2type//Zinc finger, C2H2type//Zinc finger, C2H2type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type 

35 OCBBF20157970//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc 

finger, C2H2 type//Zinc finger, C2H2 type//TRAF-type zinc finger//Zinc finger, C2H2 type//Zinc finger, C2H2 type//DM 

DNA binding domain//Zinc finger, C2H2 type 

OCBBF20165910//DnaB-like helicase 

OCBBF20166890//CAP-Gly domain//CAP-Gty domain 
40 OCBBF20166900//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Transcription factor S-ll 

(TFUS)//Zinc finger, C2H2 type//Zinc finger, C2H2 type 

OCBBF201 67290//haloacid dehalogenase-like hydrolase//Thioredoxin//AminogIycoside phosphotransferase// 
Acyl-CoA dehydrogenase 
OCBBF201 74580//Cyclin 
45 OCBBF201 74890//Death domain 

OCBBF20175360//Zinc finger, C2H2 type 

OCBBF201 77540//KRAB box//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Src homology 
domain 2//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
OCBBF20177910//Corticotropin-releasing factor family 

50 OCBBF201 82060//lnward rectifier potassium channel 

OCBBF20191950//EGF-like domain//Low-density lipoprotein receptor domain class A//Low-density lipoprotein recep- 
tor domain class A//EGF-like domain//Low-density lipoprotein receptor domain class A//EGF-like domain//Low-density 
lipoprotein receptor domain class A//Low-density lipoprotein receptor domain class A//EGF-like domain//EGF domain// 
Low-density lipoprotein receptor repeat class B//Low-density lipoprotein receptor repeat class B//Low-density lipopro- 

55 tein receptor repeat class B//Low-density lipoprotein receptor repeat class B//Low-density lipoprotein receptor repeat 
class B//EGF-like domain//60Kd inner membrane protein 
PANCR1 0000860//T rypsin//Trypsin 

PEBLM10001470//Zinc finger, C2H2 type//Fork head domain 
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PEBLM20001800//lmmunoglobulin domain//lmmunoglobulin domain/ZImmunoglobulin domain//lmmunoglobulin do- 
main//lmmunoglobulin domain 

PEBLM20003260//Zinc finger, C2H2 type//Zinc finger, C2H2 type//T ranscription factor S-ll (TFIIS)//Src homology do- 
main 2//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Src homology domain 2//Zinc finger, C2H2 type 
5 PEBLM20005020//Virion host shutoff protein 
PLACE50001 390//PHD-finger 
PLACE60004260//Cystatin domain 
PLACE60006300//lmmunoglobulin domain 

PLACE60012620//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat/AA/D domain, 
10 G-beta repeat//AN1 -like Zinc finger//FYVE zinc finger 

PLACE60054230//C methyl transferase// Form in Homology 2 Domain 

PLACE60054820//Regulator of chromosome condensation (RCC1)// Regulator of chromosome condensation (RCC1) 
//Regulator of chromosome condensation (RCC1)//Regulator of chromosome condensation (RCC1) 
PLACE60054870//IQ calmodulin-binding motif//IQ calmodulin-binding motif 
15 PLACE60055590//Zinc finger, C3HC4 type (RING finger) 

PLACE60061370//Phosphotyrosine interaction domain (PTB/PID)y/Extracellular link domain 
PLACE60062660//Gamma-adaptin, C-terminus 

PLACE60064180//Lumenal portion of Cytochrome b559, alpha (gene psbE) subunit.//Viral (Superfamily 1) RNA heli- 
case 

20 PLACE60066970//SCAN domain//Zinc finger, C2H2 type//PHD-finger//Zinc finger, C2H2 type//Zinc finger, C2H2type// 
Zinc finger, C2H2 type 

PLACE50070500//lmmunoglobulin domain//lmmunoglobulin domain 
PLACE60073090//Myo-inositol-1 -phosphate synthase 
PLACE60074820//Adenylate kinase 
25 PLACE60082850//Pathogenesis-related protein Bet v I family 
PLACE60087680//Thyroglobulin type-1 repeat 

PLACE60093380//Penicillin amidase//Bacterial regulatory proteins, lacl family/A/acuolar sorting protein 9 (VPS9) do- 
main 

PLACE60095600//DEAD/DEAH box helicase 
30 PLACE60098350//MAGE family 

PLACE60104fi30//Photosystem I reaction centre subunit VIII 

PLACE60113340//EGF-like domain//Laminin G domain//lnsulin-like growth factor binding proteins//EGF-like domain// 
Laminin G domain 

PLACE60118810//bZIP transcription factor//TPR Domain//TPR Domain//TPR Domain//TPR Domain//PPR repeat 
35 PLACE641 1 9700//EF hand 

PLACE60122970//Zinc finger, C2H2 type 

PLACE60138840//Syndecan domain//Mitochondrial carrier proteins//Mitochondrial carrier proteins//Mitochondrial car- 
rier proteins 

PLACE60140640//Phosphoribulokinase//Shikimate kinase/AJncharacterized protein family UPF0038 
40 PLACE60154450//RhoGAP domain 

PLACE60177880//lmmunoglobulin domain//lmmunoglobulin domain 

PLACE601 81 870//Pentaxin family 

PLACE601 84870//LBP / BPI / CETP family 

PROST1 0001 100//Ke rati n, high sulfur B2 protein 
45 PROST20007170//KRAB box//Zinc finger, C2H2 type//Cyclopropane-fatty-acyl-phosphoiipid synthase//Zinc finger, 

C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Transcrip- 

tion factor S-ll (TFIIS)//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 

PROST20015210//Myosin tail//Borrelia lipoproteins//Myosin tail 

PROST20016760// , chromo , (CHRromatin Organization Modifier) domain 
so PROST20024250//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 

PROST20028970//Ank repeat//Ank repeat//Glutamine am idotransf erases class-ll//Ank repeat//Ank repeat 

PROST20033240//Ephrin receptor ligand binding domain//EB module//TNFR/NGFR cysteine-rich region//Fibronectin 

type III domain 

PROST20036350//Glutathione S-transf erases. //Ribosomal protein S24e//lnterferon alpha/beta domain/ZtRNA syn- 
55 thetases class I (E and Q) 

PROST20045700//Keratin, high sulfur B2 protein 
PROST20050390//Cytochrome P450 

PROST20051310//DEAD/DEAH box helicase/ZToprim domain//Helicases conserved C-terminal domain//Zinc knuckle 



93 



EP 1 308 459 A2 



PROST20065790//Phosphofructokinase//Phosphofructokinase//Phosphofructokinase//Phosphofmctokinase 
PROST20073280//Transposase 

PROST20075280//lmmunoglobuiin domain//lmmunoglobulin domain/ZThrombospondin type 1 domain 
PROST20082430//Cyclophilin type peptidyl-prolyl cis-trans isomerase 
PROST20084720//Cytochrome P450 

PROST20087240//gag gene protein p24 (core nucleocapsid protein) 
PROST20099090//Disintegrin 

PROST20102190//EF hand//Ribosomal RNA adenine dimethylases//EF hand 
PROST20l05450//Sodium/hydrogen exchanger family 
PROST20127450//TSC-22/dip/bun family 
PROST20130320//S-1007ICaBP type calcium binding domain 
PROST20152510//TPR Domain//TPR Domain//TPR Domain 

PROST20155370//SCAN domain//KRAB box//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type// 
Zinc finger, C2H2 type//Zinc finger, C2H2 type 
PROST20168600//KTN NAD-binding domain 
FUAEN10000650//TSC-22/dip/bun family 

PUAEN10001640//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat 
PUAEN20000800//Bleomycin resistance protein 
PUAEN20001520//Acetyltransferase (GNAT) family 
PUAEN20002470//FtsK/SpolllE family 

PUAEN20003120//Myb-like DNA-binding domain//ATP synthase ab C terminal//SET domain 
SALGL10001070//Family 4 glycosyi hydrolase 
SKMUS20006790//von Willebrand factor type D domain 
SKMUS20007260//Syndecan domain 
SKMUS20008730//Calponin homology (CH) domain 
SKMUS20017400//Hantavirus nucleocapsid protein//Tropomyosins 
SKMUS20040440//Ribosomal protein L3 
SKMUS20091 900//Trypsin 

SKNMC10001230//Ank repeat//Ank repeat//Ank repeat 

SKNMC20006350//FKBP-type peptidyl-prolyl cis-trans isomerases//FKBP-type peptidyl-prolyl cis-trans isomerases// 
FKBP-type peptidyl-prolyl cis-trans isomerases//EF hand//EF hand 
SKNSH20009710//Tropomyosins//Tropomyosins 

SKNSH20052400//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, 
G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat 
SKNSH20057920//Eukaryotic protein kinase domain 
SMINT20000070//Rabphi1in-3A effector domain 

SMINT20002320//Protein phosphatase 2A regulatory B subunit (B56 family) 

SMINT20006020//PH domain//Phor esters/diacylglycerol binding domain (C1 domain)//FYVE zinc finger//PH domain 
SMINT20006090//Glutathione S-transf erases. 
SMINT20008110//Na+/K+ ATPase C-terminus 

SM1NT2001 1 950//KRAB box//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//PHD-finger//Zinc finger, C2H2 type 
SMINT20012220//Collagen triple helix repeat (20 copies) 
SMINT20016150//gag gene protein p17 (matrix protein).//Ferritins 
SMINT20024140//lmmunoglobulin domain 

SMINT20028800//Zinc finger, C2HC type//SAM domain (Sterile alpha motif)//Sterile alpha motif (SAM)/Pointed domain 
SMINT20028840//lmmunoglobulin domain 

SMINT20030740//KRAB box//Zinc finger, C2H2 typeV/Zinc finger, C2H2 type//Zinc finger, C2H2type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type 

SMINT20035050//SH3 domain//WW domain//PH domain//RhoGAP domain 
SMINT20036440//ENTH domain 
SMINT20038660//Olfactomedin-like domain 
SMINT20039050//Flavivirus glycoprotein//! BR domain 
SMINT20043390//Ras association (RalGDS/AF-6) domain 

SMINT20044140//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type//Zinc finger, C2H2 type//PHD-finger//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
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SMINT20044730//Envelope glycoprotein GP120 
SMINT20048720//Cytochrome P450//Cytochrome P450 
SMINT20052130//Ank repeaWAnk repeat 

SMINT20056230//lmmunoglobulin domain//lmmunoglobulin domain//lmmunoglobulin domain//lmmunoglobulin do- 
5 main 

SM I NT20062050//Reverse transcriptase (RNA-dependent DNA polymerase) 
SMINT20077960//Gelsolin repeatV/Gelsolin repeat.//Gelsolin repeatV/Gelsolin repeat. 

SMINT20083290//lmmunoglobulin domain//lmmunoglobulin domain//lmmunoglobulin domain//lmmunoglobulin do- 
main//lmmunoglobulin domain 
10 SMINT20086250//Glycine cleavage H-protein 
SMINT20086720//SCAN domain//KRAB box 
SMINT20088440//fmmunoglobulin domain 
SMINT20089600//PDZ domain (Also known as DHR or GLGF). 

SMINT20091190//lmmunoglobulin domain//lmmunoglobulin domain//lmmunoglobulin domain//lmmunoglobulin do- 
15 main//lmmunoglobulin domain 

SPLEN20006950//Reverse transcriptase (RNA-dependent DNA polymerase) 

SPLEN20011350//Helper component proteinase 

SPLEN2001 51 00//DHHC zinc finger domain 

SPLEN20023540//RasGEF domain//EF hand//EF hand 
20 SPLEN20023850//RecF protein//SMC domain N terminal domain//Tropomyosins 

SPLEN20024190//EGF-like domain//EB module//EGF-like domain//Trypsin Inhibitor like cysteine rich domain//EGF- 

like domain//EGF-like domain//WAP-type (Whey Acidic Protein) •four-disulfide coreV/EGF-like domain 

SPLEN20024930//lnosi polyphosphate phosphatase family, catalytic domain 

SPLEN20039180//Fatty acid desaturase 
25 SPLEN20042200//2inc finger, C3HC4 type (RLNG finger)//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, 

C2H2 type//Zinc finger, C2H2 type 

SPLEN20049840//Polyphosphate kinaseZ/Myosin head (motor domain)//IQ calmodulin-binding motif//SH3 domain 
SPLEN20050090//Pyridoxamine 5'-phosphate oxidase//GRIP domain 
SPLEN20054500//Renal dipeptidase 
30 SPLEN20055600//K+ channel tetramerisation domain//BTB/POZ domain//Zinc finger, C2H2 type 

SPLEN20057900//Scorpion short toxins//EGF-like domain//EGF-like domaiiV/Keratin, high sulfur B2 protein 
SPLEN20059270//Zinc finger, C3HC4 type (RING finger)//B-box zinc finger.//SPRY domain 
SPLEN20063250//Zinc finger, C2H2 type 

SPLEN20063890//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich RepeatZ/Leucine 
35 Rich Repeat//Leucine Rich Repeat//Leucine Rich RepeatZ/Leucine Rich Repeat//Leucine Rich Repeat 

SPLEN20071 820//DNA polymerase X family 

SPLEN20076470//TPR Domain//TPR Domain//TPR Domain 

SPLEN20080070//Alpha-L-fucosidase 

SPLEN20085910//Double-stranded RNA binding motif 
40 SPLEN20090880//lmmunoglobulin domain 

SPLEN20098030//Zinc finger, C3HC4 type (RING finger)//B-box zinc finger. 

SPLEN201 01 950//Sodium/hydrogen exchanger family 

SPLEN20104150//Ribosomal protein L36 

SPLEN20108000//short chain dehydrogenase 
45 SPLEN201 1 01 80//Transposase 

SPLEN20118050//Leucine rich repeat N-terminal domain//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich 

Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich RepeaV/Leucine Rich Repeat//Leucine Rich Repeat 

SPLEN20135030//PDZ domain (Also known as DHR or GLGF).//GATA zinc finger//LIM domain containing proteins 

SPLEN20139100//Sodium and potassium ATPases//fmmunog!obulin domain//lmmunoglobulin domain//lmmunoglob- 
50 ulin domain//lmmunoglobulin domain 

SPLEN20139360//Bacterial regulatory proteins, lacl family//Site-specific recombinases 

SPLEN20175920//Uncharacterized protein family UPF0036 

SPLEN201 80980//Glutathione S-transferases. 

SPLEN20182990//BTB/POZ domain//Kelch motif//Kelch motif//Ketch motif//Kelch motif 
55 SPLEN201 83020//Laminin G domain 

SPLEN20191020//Src homology domain 2 

SPLEN20193790//Dynamin family//Dynamin family//Proteasome activator pa28 beta subunit//Peroxidase//Dynamin 
central region//Dynamin GTPase effector domain SPLEN2019571 0//TPR Domain/ATPR Domain//TPR Domain 
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SPLEN20197930//K-box region 

SPLEN201 98390//WD domain, G-beta repeat//Fibrillarin//WD domain, G-beta repeat/A/VD domain, G-beta repeat/A/VD 
domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat 
SPLEN20201830//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine 
Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat 
TESTI10001570//KRAB box//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//Zinc finger, C2H2type//Zinc finger, C2H2 type//PHD-finger//Zinc finger, C2H2 type//Zinc finger, C2H2 type//TRAF- 
type zinc finger/ZZinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Tran- 
scription factor S-lt (TFIIS)//Zinc finger, C2H2 type 

TESTI20006160//Large-conductance mechanosensitive channel, MscU/CbiM 
TESTI20006830//PWWP domain 
TESTI20012080//Chitin synthase 
TESTI20016970//TPR Domain 

TESTI20030200//Double-stranded RNA binding motif//Adenosine-deaminase (editase) domain//Adenosine-deami- 
nase (editase) domain 

TESTI20030440//Plant PEC family metallothionein//Tropomyosins 
TESTI20031310//Serpins (serine protease inhibitors) 
TESTI20038240//Peptidase family M13 

TE STI 2004 1630//O uter membrane efflux protein//lntermediate filament proteins 

TESTI20043910//IQ calmodulin-binding motif//IQ calmodulin-binding motif//lQ calmodulin-binding motif//IQ calmodu- 
lin-binding motif//IQ calmodulin-binding motif 

TESTI20045390//lmmunoglobulin domain//lmmunoglobulin domain//lmmunoglobulin domain//lmmunoglobu!in do- 
main//lmmunoglobulin domain//lmmunoglobulin domain//lmmunoglobulin domain 
TESTI20046110//Extracellular link domain 

TESTI20046490//LIM domain containing proteins//Somatomedin B domain 

TESTI20046870//CRAL/TRIO domainV/CRAL/TRIO domain.//MSP (Major sperm protein) domain 
TESTI20046890//PHD-finger 
TESTI20049060//lmmunoglobulin domain 
TESTI20049410//Proprotein convertase P-domain 

TESTI20050720//Coenzyme A transferase//Alpha-2-macroglobulin family N -terminal region//Coenzyme A transferase 

TESTI20051730//Glutamine am idotransf erases cl ass- 1 l//alp ha/beta hydrolase fold 

TESTI20053070//WD domain, G-beta repeat/A/VD domain, G-beta repeat 

TESTI20053950//IQ calmodulin-binding motif 

TESTI20055880//Serum amyloid A protein 

TESTI20056030//Myosin tail 

TESTI20057430//KRAB box//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//PHD-finger//Zinc 
finger, C2H2 type//Zinc finger, C2H2 type//Transcription factor S-ll (TFIIS)//Zinc finger, C2H2 type//Zinc finger, C2H2 
type 

TESTI20057590//Leucine rich repeat C-terminal domain//lmmunoglobulin domain 
TESTI20057840//SAP domain//Zinc knuckle//Zinc finger, C3HC4 type (RING finger) 
TESTI20057880//Zinc finger, C3HC4 type (RING finger) 

TESTI20058350//Polyomavirus coat protein//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G- 
beta repeat//WD domain, G-beta repeat/A/VD domain, G-beta repeat/A/VD domain, G-beta repeat 
TESTI20058920//Tubulin/FtsZ family 

TESTI20059080//Thermophilic metalloprotease (M29)//Hyaluronidase 
TESTI20059480//Cyclic nucleotide-binding domain 

TESTI2005981 01/KRAB box//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Transcription factor S-ll (TFIIS)//Zinc fin- 
ger, C2H2 type//DM DNA binding domain//Zinc finger, C2H2 type//Zinc finger, C2H2 type//BolA-like protein//Zinc finger, 
C2H2 type//Coronavirus M matrix/g ly cop rote in//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
TESTI20060350//DZIP transcription factor 

TESTI20060830/IZAP domain//Ferric uptake regulator f am ily//Peptidase family M1//Piwi domain 
TESTI20061090//Keratin, high sulfur B2 protein 
TESTI20061200//Sugar (and other) transporter 

TESTI20064370//TPR Domain//TPR Domain//TPR Domain//TPR Domain//Synaptobrevin 

TESTI20064530//WD domain, G-beta repeat/A/VD domain, G-beta repeat//WD domain, G-beta repeat/A/VD domain, 
G-beta repeat//WD domain, G-beta repeat/A/VD domain, G-beta repeat 
TESTI20064650//Myosin head (motor domain) 
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TESTI20065650//G-protein alpha subunit 

TESTI20066150//Picornavirus 2B protein//Glutamine amidotransf erase class- l//Pancreatic hormone peptides 
TESTI20066330//Flbronectin type III domain 
TESTI20066650//RasGEF domain 
5 TESTI20067480//KRAB box//Zinc finger, C2H2 type//TRAF-type zinc finger//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//PHD-finger//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type//Zinc finger, C2H2 type//FYVE zinc f inger//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, 
C2H2type 

TESTI20068530//Zinc finger, C3HC4 type (RING finger)//PHD-finger 
10 TESTI200711 30//ATP synthase Alpha chain, C terminal 

TESTI20071630//Glutamine synthetase//SCP-like extracellular protein 

TESTI20075240//KRAB box//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Putative zinc finger in N-recognin//Zinc 
finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type//Zinc finger, C2H2 type//Transcription factor S-ll (TFIIS)//Zinc finger, C2H2 type//Zinc finger, C2H2 
is type 

TEST120076570//Dual specificity phosphatase, catalytic domain 

TESTI20079220//KRAB box//Myb-like DNA-binding dornain//Myb-like DNA-binding domain//Zinc finger, C2H2 type// 
Zinc finger, C2H2 type//Zinc finger, C2H2 type//PHD-finger//Transcription factor S-ll (TFIIS)//Zinc finger, C2H2 type// 
Zinc finger, C2H2 type//Zinc finger, C2H2 type 
20 TESTI20079980//PDZ domain (Also known as DHR or GLGF) .//Domain found in Dishevelled, Egl-10, and Pleckstrin 
TESTI20081890//PDZ domain (Also known as DHR or GLGF). 
TESTI20084250//Oxysterol-binding protein 
TESTI20086840//Von Willebrand factor type A domain 

TESTI20088840//Zinc finger, C3HC4 type (RING finger)//PHD-finger//Thymidine kinases//E7 protein, Early protein// 
25 CONSTANTS family zinc finger//B-box zinc fingerV/SPRY domain 
TESTI20092170//ENV polyprotein (coat polyprotein) 
TESTI20095200//7TM chemoreceptor 

TESTI20095770//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, 
G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat 
30 TESTI20095880//Domain of unknown function DUF33//Peniciilin amidase//Formate/nitrite transporter//Sodium:galac- 
toside symporter family TESTi20099350//GGL domain//Clusterin//Biopterin-dependent aromatic amino acid hydroxy- 
lase 

TESTI20100090//Lectin C-type domain 

TESTI20104090//TEA domain 
35 TE STI 201 0591 0//Notch (DSL) domain//Amiloride-sensitive sodium channel 

TESTI20106170//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, 

G-beta repeat//WD domain, G-beta repeat 

TESTI20106820//Protein kinase C terminal domain 

TESTI20108060//Ser/Thr protein phosphatase 
40 TESTI201 1 2540//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Chorismate 

synthase//UvrB/uvrC motif 

TESTI20112860//Eukaryotic protein kinase domain 
TESTI20113940//Divalent cation transporter 

TESTI20114480//Zinc finger, C4type (two domains)//Zinc finger, C2HCtype//SAM domain (Sterile alpha motif)//Sterile 
45 alpha motif (SAM)/Pointed domain 

TESTI20116050//UBX domain 

TESTI20120500//Kelch motif//Kelch motif 

TESTI20120900//DNA gyrase/topoisomerase IV, subunit A 

TESTI20121040//Protein phosphatase 2C//Protein phosphatase 2C 
so TESTI201 22070//ELM2 domain//Myb-like DNA-binding domain 

TESTI20125280//lmmunoglobulin domain//lmmunoglobulin domain 

TESTI20125920//PCI domain 

TESTI20126280//recA bacterial DNA recombination proteins 
TESTI20131440//Carboxypeptidase activation peptide//Zinc carboxypeptidase 
55 TESTI20132310//Ubiquitin carboxyl-terminal hydrolase family 2 

TESTI20134680//MYND finger//B-box zinc finger.//CONSTANS family zinc finger//Putative zinc finger in N-recognin 
TESTI20134970//Double-stranded RNA binding motif//Aldehyde oxidase and xanthine dehydrogenase, C terminus// 
Adenosine-deaminase (editase) domain 
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TESTI201 40970//lmmunoglobulin domain 
TESTI20145780//Src homology domain 2 

TESTI20148380//TPR Domain//TPR Domain//TPR Domain//TPR Domain//TPR Domain//TPR Domain//PPR repeat// 
Neuraminidases//ATP synthase Alpha chain, Cterminal//TPR Domain//TPR Domain//TPR Domain//TPR Domain//PPR 
5 repeat//TPR Domain 

TESTI20150420//RhoGAP domain 

TESTI20150920//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine 
Rich Repeat 

TE STI 201 5331 0//lntermediate filament tail domain 
io TESTI201 62780//Divalent cation transporter 

TESTI20162980//Ubiquitin family//Retroviral aspartyl protease//Retroviral aspartyl protease 

TESTI20164210//lsocitrate and isopropylmalate dehydrogenases//Ribosomal protein S27a//TILa domain//von Wille- 
brand factor type C domain 
TESTI20165990//Ribosomal protein L36 
'5 TESTI201 66290//Zinc finger, C2H2 type//FAD binding domain//Phosphoenolpyruvate carboxykinase//Ribosomal pro- 
tein S11 

TESTI20166670//Zinc finger C-x8-C-x5-C-x3-H type (and similar). 
TESTI20169500//GGL domain 
TESTI20170280//Flagellar L-ring protein 
20 TESTI201 73960//KRAB box//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//PHD-finger//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type 

TESTI20179230//Dihydropyridine sensitive L-type calcium channel (Beta subunit) 
TESTI20182760//Amiloride-sensitive sodium channel 

25 TESTI20183680//Gas vesicles protein GVPc repeated domain 

TESTI201 84750//Laminin G domain//Thrombospondin N-terminal -like domains//Laminin G domain 
TESTI201 84760//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//PHD-finger//Transcription 
factor S-ll (TFIIS)//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
TESTI20186110//Divalent cation transporter/ZTranslation initiation factor IF-3//Divalent cation transporter 

30 TESTI201 93080//Growth-Arrest-Specif ic Protein 2 Domain 
TESTI20194880//SAP domain 

TESTI20196690//Glycine cleavage T-protein (aminomethyl transferase) 
TESTI20197030//Pancreatic hormone peptides 

TESTI20197600//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
35 TESTI201 991 1 0//Disintegrin 

TESTI20205100//DNA gyraseAopoisomerase IV, subunit A 

TESTI20205250//MORN motif//MORN motif//MORN motif//MORN motif//MORN motif//MORN motif//MORN motif 
TESTI20207170//Nucleosome assembly protein (NAP) 
TESTI20210570//CRAUTR1O domain. 
40 TESTI20212970//DEAD/DEAH box helicase//Helicases conserved C-terminal domain 
TESTI20219110//Eukaryotic protein kinase domain 

TESTI20222030//Hemagglutinin//ATP synthase Alpha chain, C terminal//AMP-binding enzyme 
TESTI20222460//lntermediate filament proteins 

TESTI20227380//DEAD/DEAH box helicase//Helicases conserved C-terminal domain 
45 TESTI202281 20//RhoGAP domain 

TESTI20228740//Zinc finger, C2H2 type 

TESTI20244220//Cecropin family//Fes/CIP4 homology domain//Hr1 repeat motif//SH3 domain 
TESTI20244430//Ank repeat//Ank repeat//Ank repeat//SAM domain (Sterile alpha motif) 

TESTI20244460//pKID domain//Adenylate kinase/ZThymidylate kinase//ATPases associated with various cellular ac- 
50 tivities (AAA) 

TESTI20246480//RNA recognition motif, (a.k.a. RRM, RBD, or RNP domain) 
TESTI20251610//Chitin synthase//von Willebrand factor type A domain 
TESTI20252690//Domain found in Dishevelled, Egl-10, and Pleckstrin 

TESTI20254030//LIM domain containing proteins//LIM domain containing proteins//Villin headpiece domain 
55 TESTI20254990//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
TESTI20255460//ZAP domain//Piwi domain 

TE STI 202579 10//C lass I Histocompatibility antigen, domains alpha 1 and 2//lmmunoglobulin domain 
TESTI20258720//Ank repeat//Ank repeat//Ank repeat 
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TESTI202591 101 /Zinc finger, C3HC4 type (RING finger)//PHD-finger 
TESTI20261040//DNA polymerase (viral) C-terminal domain 
TESTI20261 1 60//PH domain 

TESTI20261680//Hsp20/alpha crystallin family//Granulins 
5 TESTI20262150//lon transport protein 
TESTI20262940//Phosphofructokinase 

TESTI20264530//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, 
G-beta repeat//WD domain, G-beta repeat//C.elegans integral membrane protein Srb//WD domain, G-beta repeat// 
WD domain, G-beta repeat//SAM domain (Sterile alpha motif)//Sterile alpha motif (SAM)/Pointed domain 
10 TESTI2026491 0//Uteroglobin family 

TESTI20266050//Zinc finger, C3HC4 type (RING finger)//SPRY domain 

TESTI20274960//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type//T ranscription factor S-ll (TFIIS)//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 

15 TESTl20278280//PMP-22/EMP/MP20/Claudin family 

TESTI20282530//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
TESTI20284260//Histone-like transcription factor (CBF/NF-Y) and archaeal histone//Core histone H2A/H2B/H3/H4 
TESTI20285230//Adenosine-deaminase (editase) domain 

20 THYMU10004280//NHL repeat//NHL repeat 

THYMU20006020//lsocitrate and isopropylmalate dehydrogenases 
THYMU20009500//TPR Domain 

THYMU20013250//LIM domain containing proteins//Rl01/ZK632.3/MJ0444 family//Eukaryotic protein kinase domain 
THYMU20018250//TPR Domain 
25 THYMU2001 9260//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
THYMU20021090//SAM domain (Sterile alpha motif)//Sterile alpha motif (SAM)/Pointed domain 
THYMU20028150//lmmunoglobulin domain//lmmunoglobulin domain//lmmunoglobulin domain//lmmunoglobulin do- 
main 

THYMU20028410//BRCA1 C Terminus (BRCT) domain//BRCA1 C Terminus (BRCT) domain 
30 THYMU20031330//4Fe-4S iron sulfur cluster binding proteins, NifH/frxC family 

THYMU20032820//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 

THYMU20039320//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, 
G-beta repeat 

35 THYMU20046350//Cytochrome C and Qui no I oxidase polypeptide I 

THYMU20049060//RNA recognition motif, (a.k.a. RRM, RBD, or RNP domain) 

THYMU20052830//lmmunoglobulin domain//lmmunoglobulin domain//lmmunoglobulin domain//lmmunoglobulin do- 
main 

THYMU20055450//Zona pellucida-like domain 
40 THYMU20055460//Putative esterase 

THYMU20055760//Na+/K+ ATPase C-terminus//Phospholipase A2 
THYMU20062770//Zona pellucida-like domain 

THYMU20063650//Ribulose-phosphate 3 epimerase family//lndole-3-glycerol phosphate synthases 
TH YM U20066660//D E AD/D E AH box helicase 
45 THYMU20070250//Transketolase//Dehydrogenase E1 component/ZTransketolase 

THYMU20071120//KRABbox//Zinc finger, C2H2type//Zincfinger, C2H2type//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 
type 

50 THYMU20077250//HMG (high mobility group) box 
THYMU20081110//LIM domain containing proteins 
THYMU20083390//11S plant seed storage protein 

THYMU20090230//RNA recognition motif, (a.k.a. RRM, RBD, or RNP domain) 
THYMU20095920//lron hydrogenase small subunit 
55 THYMU20097920//PH domain//FERM domain (Band 4.1 family) 

THYMU20098350//bZIP transcription facto r//bZIP transcription factor//Tubulin/FtsZ family//lntermediate filament pro- 
teins 

THYMU20099060//lmmunoglobulin domain//lmmunoglobulin domain//lmmunoglobulin domain//lmmunoglobulin do- 
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main//lmmunoglobulin domain 

THYMU20100940//Protein of unknown function DUF132 

THYMU20104480//RNA recognition motif, (a.k.a. RRM, RBD, or RNP domain) 
THYMU20106990//PH domain 
5 THYMU20110720//Aminotransf erases class-Ill pyridoxal-phosphate 

THYMU20112590//Acyl-CoA dehydrogenase//Adaptin N terminal region 

THYMU20120240//Ubiquitin carboxyl-terminal hydrolases family 2//Ubiquitin carboxyl-termina! hydrolase family 2 
THYMU20120730//Aldehyde dehydrogenase family 

THYMU20121040//bZIP transcription factor//EF-1 guanine nucleotide exchange domain 
10 THYMU201 391 60/AJncharacterized protein family UPF0031 

THYMU20143230//EGF-like domain//Extracellular link domain/ZFasciclin domain 
THYMU20145990//SH3 domain 

THYMU20153210//7 transmembrane receptor (Secretin family) 
THYMU201 70230//Glycine cleavage T-protein (aminomethyl transferase) 
15 THYMU20176010//WD domain, G-beta repeat//PQQ enzyme repeat//WD domain, G-beta repeat//WD domain, G-beta 
repeat//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat/A/VD domain, G-beta re- 
peat//WD domain, G-beta repeat 

THYMU201784407/lmmunog1obulin domain//lmmunoglobulin domain//lmmunoglobulin domain//lmmunoglobulin do- 
main 

20 THYMU20184550//HSF-type DNA-binding domain//bZIP transcription factor 

THYMU20191970//Cadherin domain//Cadherin domain//Cadherin domain//Cadherin domain 
TKIDN10000620//Thioredoxin 

TKIDN10001920//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2 H 2 type//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
25 TRACH20011010//5 , -nucleotidase 

TRACH20021380//Copper/zinc superoxide dismutase (SODC)//Adenylate and Guanylate cyclase catalytic domain// 
Adenylate and Guanylate cyclase catalytic domain 

TRACH20029880//MORN motif//MORN motif//Penicillin amidase//Bacterial regulatory proteins, lac I family/A/acuolar 

sorting protein 9 (VPS9) domain 
30 TRACH20040390//Pumilio-family RNA binding domains (aka PUM-HD, Pumilio homology domain)//Pumilio-family 

RNA binding domains (aka PUM-HD, Pumilio homology domain)//Pumilio-family RNA binding domains (aka PUM-HD, 

Pumilio homology domain)//Pumilio-family RNA binding domains (aka PUM-HD, Pumilio homology domain)//Pumilio- 

family RNA binding domains (aka PUM-HD, Pumilio homology domain)//Pumilio-family RNA binding domains (aka 

PUM-HD, Pumilio homology domain)//Pumilio-family RNA binding domains (aka PUM-HD, Pumilio homology domain) 
35 //Pumilio-family RNA binding domains (aka PUM-HD, Pumilio homology domain) 

TRACH20043360//Kinesin motor domain//Caspase recruitment domain//Ribosomai protein L35 

TRACH20058000//Fibronectin type III domain 

TRACH20090060//C2 domain 

TRACH20091070//Aldehyde dehydrogenase family 
40 TRACH20093400//Adaptin N terminal region 

TRACH2009851 0//Ribosomal L29 protein 

TRACH20104510//Uncharacterized protein family UPF0005 

TRACH20108240//RNA recognition motif, (a.k.a. RRM, RBD, or RNP domain)//RNA recognition motif, (a.k.a. RRM, 
RBD, or RNP domain) 
45 TRACH201 1 3020//AI R synthase related protein 

TRACH20122980//TPR Domain//TPR Domain//TPR Domain//TPR Domain//TPR Domain 
TRACH20131230//PH domain//Oxysterol-binding protein 
TRACH20139280//PX domain 

TRACH201437107/Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat 
50 TRACH20149500//Zinc finger//Plexin repeat//Cysteine rich repeat//lntermediate filament proteins 
TRACH20149740//Sodium:dicarboxylate symporter family 

TRACH20163470//Putative integral membrane protein DUF46//Sugar (and other) transporter//Sodium:galactoside 
symporter family 

TRACH20164100//Retroviral aspartyl protease 
55 TRACH201 6481 0//D- isomer specific 2-hydroxyacid dehydrogenases 
TRACH20167090//Chitinases, family 2 

TRACH20170860//lmmunoglobulin domain//lmmunoglobulin domain//lmmunoglobulin domain//lmmunoglobulin do- 
main 
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TRACH201 88350//Tropomyosins 
TRACH20190460//Lipase (class 3) 

UTERU20000950//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, 
G-beta repeat/A/VD domain, G-beta repeat 
5 UTERU20016580//Zinc finger present in dystrophin, CBP/p300//Myb-like DNA-binding domain 

UTERU20026620//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type 

UTERU20041630//KRABbox//Zinc finger, C2H2 type//Zinc finger, C2H2type//Zinc finger, C2H2type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Transcription factor S-li (TFIIS)//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
10 UTERU20083020//Domain of unknown function DUF71 

UTERU20086530//Lipocalin / cytosolic fatty-acid binding protein family 
UTERU20087070//Sushi domain (SCR repeat)//Trypsin 

UTERU20089390//TPR Domain//TPR Domain//TPR Domain//TPR Domain//TPR Domain//TPR Domain//TPR Do- 
main//TPR Domain 
15 UTERU20089620//Fringe-like 

UTERU20099040//Cation efflux family 

UTERU20099510//KRABbox//Zinc finger, C2H2type//Zinc finger, C2H2type//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//PHD-finger//Zinc finger, C2H2 type 
20 UTERU201 0431 0//RN A polymerases K/ 14 to 18 kDa subunit 
UTERU20121140//Rhodanese-like domain 

UTERU20122520//FERM domain (Band 4.1 family)//FERM domain (Band 4.1 family) 
UTERU20127030//Xylose isomerase//ApbE family 

UTERU20127150//Translation initiation factor IF-3//Divalent cation transporter 
25 UTERU201 285607/Domain of unknown function DUF28 
UTERU20132620//HMG14 and HMG17 

UTERU20139760//Mrtochondrial carrier proteins/ZMitochondrial carrier proteins 
UTERU20168960//PH domain//Methanol dehydrogenase beta subunit 
UTERU20181270//Zinc knuckle 
30 UTERU201 85220//Bromodomain 

EXAMPLE 6 

Functional categorization based on the full-length nucleotide sequences 

35 

[0224] The functional prediction and categorization of the proteins encoded by the clones were carried out based 
on the result of homology search of the databases of GenBank, Swiss-Prot, UniGene and nr (see the Homology Search 
Result Data) for the full-length nucleotide sequences and the result of domain search of the amino acid sequences 
deduced from the full-length nucleotide sequences (see Example 5). 

40 [0225] The clone predicted to belong to the category of secretory protein/membrane protein means a clone having 
hit data with some annotation, such as growth factor, cytokine, hormone, signal, transmembrane, membrane, extra- 
cellular matrix, receptor, G-protein coupled receptor, ionic channel, voltage-gated channel, calcium channel, cell ad- 
hesion, collagen, connective tissue, etc., suggesting that it is a secretory or membrane protein, or means a clone in 
which the presence of nucleotide sequence encoding a signal sequence or transmembrane domain was suggested 

45 by the results of PSORT and SOSUI analyses for deduced ORF. 

[0226] The clone predicted to belong to the category of glycoprotein -related protein means a clone having hit data 
with some annotation, such as glycoprotein, suggesting that the clone encodes a glycoprotein-related protein. 
[0227] The clone predicted to belong to the category of signal transduction-related protein means a clone having hit 
data with some annotation, such as serineAhreonine-protein kinase, tyrosine-protein kinase, SH3 domain, SH2 domain, 

50 etc., suggesting that the clone encodes a signal transduction-related protein. 

[0228] The clone predicted to belong to the category of transcription-related proteinemeans a clone having hit data 
with some annotation, such as transcription regulation, zinc finger, homeobox, etc., suggesting that the clone encodes 
a transcription- related protein. 

[0229] The clone predicted to belong to the category of disease-related protein means a clone having hit data with 
55 some annotation, such as disease mutation, syndrome, etc., suggesting that the clone encodes a disease-related 
protein, or means a clone whose full-length nucleotide sequence has hit data for Swiss-Prot, GenBank, or UniGene, 
where the hit data corresponds to genes or proteins which have been deposited in the Online Mendelian Inheritance 
in Man (OMIM) (http://www.ncbi.nlm.nih.gov/Omim/), which is the human gene and disease database. 
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[0230] The clone predicted to belong to the category of enzyme and/or metabolism-related protein means a clone 
having hit data with some annotation, such as metabolism, oxidoreductase, E. C. No. (Enzyme commission number), 
etc., suggesting that the clone encodes an enzyme and/or metabolism-related protein. 

[0231] The clone predicted to belong to the category of cell division and/or cell proliferation-related protein means 
5 a clone having hit data with some annotation, such as cell division, cell cycle, mitosis, chromosomal protein, cell growth, 

apoptosis, etc., suggesting that the clone encodes a cell division and/or cell proliferation-related protein. 

[0232] The clone predicted to belong to the category of cytoskeleton -related protein means a clone having hit data 

with some annotation, such as structural protein, cytoskeleton, actin-binding, microtubles, etc., suggesting that the 

clone encodes a cytoskeleton-related protein. 
10 [0233] The clone which is predicted to belong to the category of nuclear protein and/or RN A synthesis-related protein 

means a clone having hit data with some annotation, such as nuclear protein, RNA splicing, RNA processing, RNA 

helicase, polyadenylation, etc., suggesting that the clone encodes a nuclear protein and/or RNA synthesis- related 

protein. 

[0234] The clone predicted to belong to the category of protein synthesis and/or transport-related protein means a 
15 clone having hit data with some annotation, such as translation regulation, protein biosynthesis, amino-acid biosyn- 
thesis, ribosomal protein, protein transport, signal recognition particle, etc., suggesting that the clone encodes a protein 
synthesis and/or transport- related protein. 

[0235] The clone predicted to belong to the category of cellular defense- related protein means a clone having hit 
data with some annotation, such as heat shock, DNA repair, DNA damage, etc., suggesting that the clone encodes a 
20 cellular defense- related protein. 

[0236] The clone predicted to belong to the category of development and/or differentiation-related proteins means 
a clone having hit data with some annotation, such as developmental protein, etc., suggesting that the clone encodes 
a development and/or differentiation-related protein. 

[0237] The clone predicted to belong to the category of DNA-binding and/or RNA-binding protein means a clone 
25 having hit data with some annotation, such as DNA-binding, RNA-binding, etc. 

[0238] The clone predicted to belong to the category of ATP-binding and/or GTP-bihding protein means a clone 
having hit data with some annotation, such as ATP-binding, GTP-binding, etc. 

[0239] In this functional categorization, when a single clone corresponded to multiple categories of those shown 
above, the clone was assigned to the multiple categories. However, the function of a protein is not restricted to the 
30 functional category in this classification, and there is the possibility that other functions are newly assigned to the protein. 
[0240] The clones predicted to belong to the category of secretory protein and/or membrane protein are the following 
516 clones. 

ADRGL20020290, ADRGL20021 91 0, ADRGL20036380, ADRGL20036840, ADRGL2005961 0, ADRGL20063770, 
ADRGL20066770, ASTRO2001 001 0, ASTRO20020240, ASTRO20045840, ASTRO20053430, ASTRO20055530, 

35 ASTRO20055570, ASTRO20055930, ASTRO20088950, ASTRO20091180, BNGH420021680, BNGH420023870, 
BNGH420046790, BNGH420052350, BNGH420059680, BNGH4200 75940, BNGH420077980, BRACE1 0000510, 
BRACE20051930, BRACE20052530, BRACE20054080, BRACE20066360, BRACE20068710, BRACE20069000, 
BRACE20069110, B R AC E20 194670, BRACE20204670, BRACE2021 6950, BRAMY1 0001 730, BRAMY20003880, 
B RAM Y2001 3670, BRAMY20024790, BRAMY20027390, BRAMY20028530, BRAMY20035380, BRAMY20044920, 

40 BRAMY20045210, BRAMY20047560, BRAMY20050940, BRAMY2005391 0, BRAMY20055760, BRAMY200 72440, 
BRAMY20083820, BRAMY20089770, BRAMY20091230, BRAMY20094890, BRAMY20096930, BRAMY201 18410, 
BRAMY201 23400, BRAMY20125550, B R AM Y20 12731 0, BRAMY201 27760, BRAMY201 35720, BRAMY201 37360, 
BRAMY201 39440, BRAMY20152510, BRAMY201 94680, BRAMY20204270, BRAMY20225320, BRAMY202371 90, 
BRAMY20245140, BRAMY20251750, BRAMY20285650, BRAWH20020470, BRAWH2002191 0, BRAWH20026010, 

45 BRAWH20030000, BRAWH20039640, BRAWH 2 0055330, B R AW H20 078620, BRAWH20093070, BRAWH201 85270, 
BRCAN 10000760, BRCAN1 0001 680, BRCAN20001480, BRCAN200041 80, BRCAN20005230, BRCOC20000470, 
BRCOC20003600, BRHIP1 0000720, BRHIP1 0001 040, BRHIP20000210, BRSSN20001 970, BRSSN20074640, 
BRSSN20091190, CD34C20001750, CTONG2001 7490, CTONG20036990, CTONG20041260, CTONG20044870, 
CTONG20045500, CTONG20049480, CTONG20051450, CTONG20055850, CTONG200561 50, CTONG20059130, 

50 CTONG20060040, CTONG20063770, CTONG20065680, CTONG20068360, CTONG20069320, CTONG20071 680, 
CTONG20076810, CTONG 20078340, CTONG20079590, CTONG20083980, CTONG20084020, CTONG20085210, 
CTONG201 67750, CTONG201 68240, CTONG201 79890, CTONG201 83830, CTONG20 184830, DFNES2001 8000, 
DFNES20029660, DFNES20057660, DFNES20072990, DFNES20080880, FCBBF20018680, FCBBF20029280, 
FCBBF20032930, FCBBF20036360, FCBBF20054390, FCBBF30004340, FCBBF30022680, FCBBF30029250, 

55 FCBBF30042610, FCBBF30062490, FCBBF30075970, FCBBF30078600, FCBBF30091520, FCBBF3009541 0, 
FCBBF301 05440, FCBBF301 1 8670, FCBBF301 32660, FCBBF301 35890, FCBBF301 45670, FCBBF301 6451 0, 
FCBBF301 69870, FCBBF301 71230, FCBBF301 72330, FCBBF301 77290, FCBBF301 79740, FCBBF301 95690, 
FCBBF301 97840, FCBBF3021221 0, FCBBF30223110, FCBBF3022321 0, FCBBF30225930, FCBBF3023061 0, 
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FCBBF30260480, FCBBF30266510, FCBBF30287940, FCBBF5000061 0, FCBBF50004950, 
FEBRA20018670, FEBRA20031280, FEBRA20031 81 0, FEBRA20038220, FEBRA20039260, 
FEBRA20040560, FEBRA20046280, FEBRA20080860, FEBRA20084750, FEBRA2008881 0, 
FEBRA201 16650, FEBRA20121950, FEBRA20141980, FEBRA201 77800, FEBRA201 82030, 
HCHON1 0001 660, HCHON20015050, HEART1 0001490, HEART20031680, HHDPC10001140, 
HHDPC20082790, HHDPC20088160, HLUNG20015070, HLUNG20015180, HLUNG20020850, 
HLUNG20032460, HLUNG20033350, HLUNG20034970, HLUNG200371 60, HLUNG20041540, 
HLUNG20050760, HLUNG20052300, HLUNG20060670, HLUNG20065990, HLUNG20074330, 
HLUNG20088750, HLUNG20092530, KIDNE2001 6360, KIDNE200831 50, KIDNE20084030, 
KIDNE20084800, KIDNE20086490, KIDNE20086660, KIDNE20094670, KIDNE20134130, 
KIDNE20143200, KIDNE20148080, KIDNE201 60960, KIDNE201 63710, KIDNE201 691 80, 
KIDNE201 861 70, KIDNE201 88630, KIDNE201 89960, LIVER20007750, LIVER2001 051 0, 
LIVER20026440, LIVER20030650, LIVER20038000, MESAN20007110, MESAN20008150, 
MESAN20027900, MESAN20058110, MESAN20059570, MESAN20060430, MESAN20067430, 
MESAN20095220, NT2NE2 001 8740, NT2NE20021 860, NT2NE20039210, NT2NE20053230, 
NT2NE20064780, NT2NE20069580, NT2NE20080770, NT2NE200821 30, NT2NE20092950, 
NT2NE201 45250, NT2NE201 4651 0, NT2NE201 52620, NT2NE201 67660, NT2NE201 81 800, 
NT2R 120021 200, NT2RI20033920, NT2RP70003110, NT2RP70027790, NT2RP70031070, 
NT2RP70056690, NT2RP700871 40, NTONG20034540, NTONG20053630, OCBBF20000740, 
OCBBF201 09780, OCBBF2011 021 0, OCBBF2011 0730, OCBBF201 12280, OCBBF2011 8720, 
OCBBF201 23200, OCBBF201 55030, OCBBF20155900, OCBBF201 6591 0, OCBBF201 70350, 
OCBBF201 85630, OCBBF20191 950, PANCR1 0000860, PEBLM20001800, PLACE50001290, 
PLACE60006300, P LAC E6 0053280, PLACE60055590, PLACE60056910, PLACE60057860, 
PLACE60064740, PLACE60070500, PLACE60087680, PLACE 601 04630, PLACE601 0701 0, 
PLACE601 38840, PLACE601 54450, PLACE601 84870, PROST1 0001 100, PROST20011160, 
PROST20035830, PROST20045700, PROST20050390, PROST200651 00, PROST20073280, 
PROST20084680, PROST20 084720, PROST20099090, PROST201 05450, PROST20206060, 
PROST20110120, PROST20114100, PROST20 146590, PROST2015251 0, PROST201 68600, 
SKMUS20006790, SKMUS20020770, SKMUS20073150, SKMUS20091900, SKNMC20006350, 
SMINT20006090, SMINT200081 1 0, SMINT20024140, SMINT20028840, SMINT20045470, 
SMINT20081330, SMINT20086250, SMINT20088440, SMINT20088690, SMINT20092160, 
SPLEN20017610, SPLEN2001 781 0, SPLEN200241 90, SPLEN20024620, SPLEN20054500, 
SPLEN20063890, SPLEN20073880, SPLEN20080070, SPLEN20090880, SPLEN201 01 950, 
SPLEN20105100, SPLEN20108000, SPLEN2011 01 80, SPLEN2011 0860, SPLEN2011 8050, 
SPLEN20125230, SPLEN20136700, SPLEN201 38600, SPLEN20139100, SPLEN201 75920, 
SPLEN201 82850, SPLEN201 83020, SPLEN201 83950, SPLEN201 90080, SPLEN201 90770, 
SPLEN201 93490, SPLEN201 93790, SPLEN20201 830, SPLEN20204670, TESOP1 0000350 
TESTI200061 60, TESTI200291 00, TESTI20031 31 0, TESTI20032770, 

TESTI20046540, TESTI20046870, 
TESTI20053260, TESTI20053780, 



TESTI20043220, TESTI20045390, 
TESTI20051200, TESTI20051 730, 



TESTI20059080, TESTI20061 200, TESTI200621 20, TESTI20063330, 
TESTI20066330, TESTI20068530, TESTI20070400, TESTI20070740, 



TESTI20095200, 
TESTI201 13940, 
TESTI201 49880, 
TESTI201 83680, 
TESTI20204260, 
TESTI20245860, 
TESTI20262150, 



TESTI20151800, 
TESTI201 84750, 
TESTI20210030, 
TESTI20246410, 



TESTI20170170, 



TESTI20038240, 
TESTI20047370, 
TESTI20057200, 
TESTI20063410, 
TESTI20073460, 
TESTI201 02390, 
TESTI20131440, 
TESTI201 73050, 



TESTI201 98540, TESTI20 1 991 1 0, 
TESTI2021 91 1 0, TESTI20244730, 
TESTI2025791 0, TESTI20260640, 



TESTI20095440, TESTI20095880, TESTI201 00090, 
TESTI201 1 61 20, TESTI201 2 1 040, TESTI201 21 71 0, 
TESTI201 62780, 
TESTI20186110, 
TESTI20214630, 
TESTI20251610, 

TESTI20262940, TESTI2026491 0, TESTI20271 790, TESTI20278280, 
TESTI20282900, TESTI20286590, THYMU20007020, THYMU20012020, THYMU2001 7270, 
THYMU20025480, THYMU200281 50, THYMU20030690, THYMU20034790, THYMU20046350, 
THYMU20050010, THYMU20052830, THYMU20054800, THYMU20055740, THYMU20055760, 
THYMU20078240, THYMU20079690, THYMU20083390, THYMU20087270, THYMU201 00940, 
THYMU201 37050, THYMU 20 137570, THYMU201 43230, THYMU20150190, THYMU2015321 0, 
THYMU201 63600, THYMU201 71 580, THY MU201 78440, THYMU201 85470, TRACH20011 01 0, 
TRACH20021380, TRACH20073990, TRACH20081270, TRACH20090060, TRACH201 49720, 
TRACH201 59390, TRACH201 63470, TRACH201 65330, TRACH201 67090, TRACH201 73680, 
UMVEN10001380, UTERU20035770, UTERU200401 50, UTERU20045200, UTERU20064120, 



FEBRA2000782.0, 
FEBRA20040230, 
FEBRA201 15930, 
FEBRA20191720, 
HHDPC20051850, 
HLUNG20029490, 
HLUNG20042730, 
HLUNG20081390, 
KIDNE20084040, 
KIDNE20142900, 
KIDNE201 82540, 
LIVER20010990, 
MESAN20021220, 
MESAN20084150, 
NT2NE20059210, 
NT2NE20140130, 
NT2RI20016240, 
NT2RP70031480, 
OCBBF20012520, 
OCBBF20120010, 
OCBBF201 76650, 
PLACE60004260, 
PLACE60061370, 
PLACE601 13340, 
PROST20014150, 
PROST20082430, 
PROST201 08850, 
PUAEN10000870, 
SKNSH20094350, 
SMINT20077960, 
SPLEN20015100, 
SPLEN20058180, 
SPLEN201 04690, 
SPLEN20121790, 
SPLEN201 77400, 
SPLEN201 93230, 
TESTI1 00001 90, 
TESTI20043130, 
TESTI20050400, 
TEST120057590, 
TESTI20063600, 
TESTI20086840, 
TESTI201 05910, 
TESTI20 142540, 
TESTI201 82760, 
TESTI20202830, 
TESTI20245600, 
TESTI20261040, 
TESTI20282420, 
THYMU20020800, 
THYMU20046770, 
THYMU20062770, 
THYMU201 15380, 
THY MU20 154790, 
TRACH20011540, 
TRACH201 49740, 
TRACH201 90460, 
UTERU20086530, 
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UTERU20087070, UTERU20087850, UTERU20089300, UTERU20089620, UTERU200951 00, LITE RU2009 9040, 
UTERU201 03200, UTERU20125810, UTERU201 27030, UTERU20127150, UTERU201 39760, UTERU201 88840 
[0241] The clones predicted to belong to the category of glycoprotein -related protein are the following 121 clones. 
ADRGL20020290, ADRGL20036840, ADRGL20059610, ADRGL20066770, ASTRO20055570, BNGH420046790, 

5 BNGH420077980, BRACE20051930, BRACE20069000, BRACE20204670, BRACE2021 6950, BRAMY2001 3670, 
BRAMY20089770, BRAMY20251210, BRAWH 2 0039640, BRCAN1 0000760, BRCAN20005230, BRCOC20003600, 
CD34C20001750, CTONG20017490, CTONG20036990, CTONG20045500, CTONG20059130, CTONG20079590, 
CTONG20085210, CTONG201 84830, DFNES2001 8000, DFNES20080880, FCBBF30004340, FCBBF30029250, 
FCBBF30062490, FCBBF30091520, FCBBF301 6451 0, FCBBF301 71 230, FCBBF301 95690, FCBBF3022321 0, 

10 FEBRA20038220, HCHON20015050, HLUNG20015070, HLUNG20032460, HLUNG200371 60, HLUNG20041540, 
KIDNE20142900, KIDNE201 691 80, KIDNE201 861 70, KIDNE201 89960, MESAN20021220, MESAN20058110, 
NT2NE20064780, NT2NE20140130, NT2NE20 155650, NT2RP70056690, NTONG20053630, OCBBF20000740, 
OCBBF20012520, OCBBF20110210, OCBBF201 20010, OCBBF201 65900, OCBBF201 65910, OCBBF201 91 950, 
PEBLM20001800, PLACE600004260, PLACE60087680, PLACE601 13340, PLACE601 84870, PROST20033240, 

15 PROST20099090, PROST20 108850, PROST201 46590, SKMUS20073150, SKNMC20006350, SMINT20028840, 
SMINT20056230, SM1NT20083290, SMINT20091190, SPLEN20024620, SPLEN20063890, SPLEN20080070, 
SPLEN20090880, SPLEN201 1 8050, SPLEN201391 00, SPLEN201 83020, SPLEN20201 830, TESTI1 0000190, 
TESTI20031 31 0, TESTI20043990, TESTI20045390, TESTI20051 200, TESTI20057590, TESTI20059080, 
TESTI20066330, TESTI20086840, TESTI201 00090, TESTI201 0591 0, TESTI201 54370, TESTI201 6421 0, 

20 TESTI201 82760, TESTI201 84750, TESTI20199110, TESTI20219110, TESTI20220230, TESTI20245600, 
TESTI20251610, TESTI2025791 0, TESTI20286590, THYMU20024500, THYMU200281 50, THYMU20052830, 
THYMU20062770, THYMU20099060, THYMU201 70080, THYMU201 78440, TRACH20011 01 0, TRACH20011540, 
TRACH20021380, TRACH20 149740, TRACH201 70860, TRACH201 90460, UTERU20086530, UTERU20087070, 
UTERU20127030 

25 [0242] The clones predicted to belong to the category of signal transduction-related protein are the following 88 
clones. 

ASTRO20050810, ASTRO20052420, ASTRO20085080, ASTRO20090680, BNGH420008150, BNGH42001 5760, 
BNGH420035290, BNGH420086030, BRAMY20035830, BRAMY20043630, BRAMY201 18490, BRAMY20206340, 
BRAMY20244490, BRAMY20251210, BRAMY20263000, BRAWH20093040, BRAWH20 190550, CTONG20004520, 

30 CTONG20029030, CTONG20030280, CTONG20063930, CTONG20070720, CTONG201 89000, FCBBF30001100, 
FCBBF30076310, FCBBF301 00080, FCBBF30 143550, FCBBF301531 70, FCBBF30 175350, FCBBF30250980, 
FEBRA20090160, FEBRA201 73330, HCHON20000870, HLUNG20011260, HLUNG20084790, KIDNE20089870, 
KIDNE201 60360, LIVER20011 640, MESAN20021130, MESAN20 027240, MESAN20065990, NT2NE2001 8890, 
NT2NE20042550, NT2RP70075800, NTONG20043080, NTONG20048440, PLACE60071 800, PROST20033240, 

35 PROST20052850, PROST20065790, PROST20075280, SKNSH20052400, SKNSH20057920, SMINT20006020, 
SMINT20035050, SPLEN20023540, SPLEN20039180, SPLEN20048800, SPLEN20049840, SPLEN20054160, 
SPLEN20085910, SPLEN201 91 020, SPLEN201 98390, TESTI20046490, TESTI20049060, TESTI20053070, 
TESTI20066650, TESTI20081 890, TESTI20095770, TESTI201 06820, TESTI201 1 2860, TESTI201 45780, 
TESTI201 50420, TESTI201 68880, TESTI20205250, TESTI202281 20, TESTI20244220, TESTI20244460, 

40 TESTI20251740, TESTI20261160, TESTI20264530, THYMU2001 3250, THYMU20039320, THYMU201 06990, 
THYMU201 45990, THYMU201 70080, THYMU201 7601 0, TRACH201 88350 

[0243] The clones predicted to belong to the category of transcription-related protein are the following 143 clones. 
ASTRO20038400, ASTRO20075150, BNGH420070370, BNGH420074600, BNGH420087430, BRACE2000331 0, 
BRACE20061620, BRAMY20001510, BRAMY20040580, BRAMY20076100, BRAMY20111 780, BRAWH20040680, 

45 BRAWH20050740, BRAWH20080580, BRAWH20082920, BRAWH20095900, BRSSN20066440, CTONG20020950, 
CTONG20044230, CTONG20053990, CTONG20072930, CTONG 20074000, CTONG20084660, CTONG20 186370, 
CTONG20 186520, DFNES200281 70, DFNES20046840, DFNES20073320, FCBBF3000361 0, FCBBF3001 9140, 
FCBBF30021900, FCBBF30093170, FCBBF301 14850, FCBBF301 29010, FCBBF301 36230, FCBBF301 43550, 
FCBBF30220050, FCBBF30228940, FCBBF30263080, FCBBF30285930, FCBBF500 03530, FEBRA20026820, 

50 FEBRA20027070, FEBRA20046510, FEBRA20057010, FEBRA20063720, FEBRA201 70240, HCHON1 0000150, 
HCHON20002650, HEART2001 931 0, HLUNG2001 4590, HLUNG2002811O, HLUNG20063700, KIDNE201 40870, 
LIVER20006260, MESAN20016270, MESAN20038520, NT2NE20038870, NT2NE20053950, NT2NE20060750, 
NT2NE20061030, NT2NE20079670, NT2NE20082600, NT2RP70001120, NT2RP70029780, NT2RP7004641 0, 
NT2RP70057500, NT2RP70075300, NT2RP70090870, OCBBF201 16250, OCBBF201 20950, OCBBF2012191 0, 

55 OCBBF201 56450, OCBBF201 57970, OCBBF201 66900, OCBBF201 75360, OCBBF201 77540, PEBLM20003260, 
PLACE60052940, PLACE60066970, PLACE601 22970, PLACE6015051 0, PLACE601 77880, PROST200071 70, 
PROST20024250, PROST200351 70, PROST20127450, PROST201 51370, PROST20 155370, PUAEN1 0000650, 
PUAEN20003120, SMINT20011 950, SMINT20026200, SMINT20030740, SMINT20039050, SMINT20044140, 



104 



EP 1 308 459 A2 



SMINT20086720, SPLEN20042200, SPLEN20043680, SPLEN20055600, S PL EN 20059270, SPLEN20063250, 
SPLEN20098030, SPLEN201 97930, TESTI1 0001 570, TESTI20057430, TESTI20057840, TESTI2005981 0, 
TESTI20067480, TESTI20068790, TESTI20075240, TESTI20079220, TESTI20088840, TESTI201 04090, 
TESTI201 22070, TESTI201 66670, TESTI201 71 070, TESTI201 73960, TESTI201 84760, TESTI201 94880, 

5 TESTI201 97600, TESTI20228740, TESTI20254030, TESTI20254990, TESTI20266050, TESTI20274960, 
TESTI20282530, THYMU 10004280, THYMU20019260, THYMU20032820, THYMU20071120, THYMU20077250, 
TKIDN1 0001 920, UTERU20016580, UTERU20026620, UTERU20041 630, UTERU20094830, UTERU20099510, 
UTERU20101150, UTERU201 69020, UTERU201 77150, UTERU201 85220, UTERU20188670 
[0244] The clones predicted to belong to the category of disease-related protein are the following 331 clones. 

10 ADRGL20020290, ADRGL20021910, ADRGL20026790, ADRGL20036840, ADRGL20059610, ADRGL20066770, 
ASTRO20038400, ASTRO20052420, ASTRO20055570, ASTRO20075150, ASTRO20088950, BNGH420008150, 
BNGH420086030, BRACE1 000051 0, BRACE20003310, BRACE200 69000, BRACE20097540, BRACE201 94670, 
BRACE20196180, BRACE20204670, BRACE20216950, BRAMY20003540, BRAMY20005080, BRAMY20035830, 
BRAMY20040580, BRAMY20043630, BRAMY20044920, BRAMY20051820, BRAMY20056620, BRAMY20089770, 

15 BRAMY20111780, BRAMY201 5251 0, BRAMY201 90550, BRAMY20221 600, BRAMY20227860, BRAMY20274510, 
BRAWH20082920, BRAWH20093040, BRAWH20095900, BRAWH201 90530, BRAWH20191980, BRCAN 10000760, 
BRCAN1 0001 050, BRCAN20005230, BRSSN20066440, CTONG20004520, CTONG20029030, CTONG20042640, 
CTONG20045500, CTONG20052780, CTONG20053990, CTONG20070780, CTONG20070910, CTONG200 72930, 
CTONG20083980, CTONG20084660, CTONG201 65750, CTONG20 169040, CTONG201 83430, CTONG201 83830, 

20 CTONG201 86290, CTONG201 89000, DFNES2001 6470, DFNES20025500, DFNES20046840, DFNES20055400, 
DFNES20080880, FCBBF1 0000230, FCBBF20035490, FCBBF20066340, FCBBF30002270, FCBBF30002280, 
FCBBF30019140, FCBBF30053300, FCBBF30071500, FCBBF30072440, FCBBF30076310, FCBBF30080730, 
FCBBF301 00080, FCBBF301 15920, FCBBF301 18670, FCBBF301 29010, FCBBF30 132050, FCBBF301 36230, 
FCBBF30153170, FCBBF301 6451 0, FCBBF301 66220, FCBBF301 71 230, FCBBF301 75350, FCBBF301 94550, 

25 FCBBF30220050, FCBBF3022321 0, FCBBF30259050, FCBBF30263080, FCBBF30275590, FCBBF50001 650, 
FEBRA20027070, FEBRA20045380, FEBRA20046200, FEBRA2004651 0, FEBRA20057010, FEBRA20063720, 
FEBRA20078800, FEBRA20087550, FEBRA2008881 0, FEBRA20090160, FEBRA20092760, FEBRA20151750, 
FEBRA201 70240, FEBRA201 73330, FEBRA20191720, HCHON1 0000150, HCHON20015050, HEART20C09590, 
HEART20022200, HEART200631 00, HHDPC20081230, HLUNG20008460, HLUNG20014590, HLUNG20032460, 

30 HLUNG20063700, HLUNG20065990, HLUNG20069350, HLUNG20081530, HLUNG20082350, HLUNG20083330, 
HLUNG20085210, KIDNE200811 70, KIDNE20084040, KIDNE20088240, KiDNE20089870, KIDNE201 33460, 
KIDNE201 34890, K1DNE20141 700, KIDNE201 42900, KIDNE201 50730, KIDNE20 152440, KIDNE201 60360, 
KIDNE201 65390, KIDNE201 691 80, KIDNE201 73430, KIDNE201 89960, LIVER20026440, MESAN20006200, 
MESAN20021130, MESAN20033220, MESAN20056890, MESAN20057240, MESAN20065990, MESAN20067430, 

35 MESAN20069530, NESOP20004520, NT2NE2001 8890, NT2NE20026200, NT2NE20037050, NT2NE20053950, 
NT2NE20061030, NT2NE2011 1190, NT2NE201 17580, NT2NE20119980, NT2NE201 40130, NT2NE20141040, 
NT2RI20093010, NT2RP70003110, NT2RP70046410, NT2RP70075300, NTONG20032100, NTONG20034540, 
OCBBF20000740, OCBBF2001 2520, OCBBF20111600, OCBBF2012001 0, OCBBF201 56450, OCBBF201 57970, 
OCBBF20191950, PEBLM20001 800, PEBLM20003260, PLACE60004260, PLACE60012620, PLACE60054230, 

40 PLACE60054870, PLACE60062660, PLACE60087680, P L AC E60 184870, PROST200 15210, PROST20024250, 
PROST20036350, PROST20050390, PROST20058860, PROST20 063430, PROST20065790, PROST20084720, 
PROST20099090, PROST20 120070, PROST20 127450, PROST20 146590, PROST201 52510, PROST201 68600, 
PUAEN1 0000650, PUAEN20003120, SKMUS20008730, SKMUS20017400, SKMUS20040440, SKMUS20073590, 
SKMUS20079150, SKNSH2000971 0, SMINT20002320, SMINT20007470, SMINT2000811 0, SMINT20011950, 

45 SMINT20016150, SMINT20026200, SMINT20030740, SMINT20049920, SMINT20077960, SMINT20083290, 
SMINT20086250, SMINT20089600, SMINT200911 90, SPLEN20023540, SPLEN200241 90, SPLEN20042200, 
SPLEN20043680, SPLEN20055600, SPLEN20057830, SPLEN20059270, SPLEN20063890, SPLEN20073500, 
SPLEN20080070, SPLEN2008591 0, SPLEN20090880, SPLEN20098030, SPLEN201 1 8050, SPLEN201 36730, 
SPLEN201 38600, SPLEN201391 00, SPLEN201 39360, SPLEN201 80980, SPLEN201 87490, SPLEN201 93790, 

50 SPLEN20201830, TESTI1 00001 90, TESTI2003131 0, TESTI20035790, TESTI20041630, TESTI20049060, 
TESTI20050720, TESTI20051 200, TESTI20057430, TESTI20057590, TESTI20059080, TESTI200621 20, 
TESTI20067480, TESTI20071 630, TESTI20099350, TESTI201 051 30, TESTI201 0591 0, TESTI201 08060, 
TESTI201 25920, TESTI201 30530, TESTI201 31 440, TESTI201 34680, TESTI20 142540, TESTI201 431 80, 
TESTI201 50420, TESTI201 54370, TESTI201 6421 0, TESTI201 66670, TESTI201 68880, TESTI201 71 070, 

55 TESTI201 82760, TESTI201 84750, TESTI201 93080, TESTI201 94880, TESTI201 96970, TESTI201 97600, 
TESTI20201760, TESTI20207170, TESTI2021 9110, TESTI20228740, TESTI20244430, TESTI20246480, 
TESTI20251 740, TESTI20252690, TESTI20254030, TESTI2025791 0, TESTI20258720, TESTI20266050, 
TESTI20271 790, TESTI20274960, TESTI20282530, TESTI20286590, THYMU 1 0004280, THYMU20006020, 
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THYMU20013250, THYMU20019260, THYMU20023560, THYMU20028150, THYMU20032820, THYMU20034400, 
THYMU20055460, THYMU20063650, THYMU20070250, THYMU20071120, THYMU20081110, THYMU20090230, 
THYMU20095920, THYMU20098350, THYMU20099060, THYMU201 20730, THYMU20121040, THYMU201 70080, 
THYMU201 85650, THYMU20191970, TKIDN1 0000620, TKIDN1 0001 920, TRACH20011540, TRACH20091 070, 
5 TRACH201 43710, TRACH201 70860, UTERU1 0001 060, UTERU20026620, UTERU20041 630, UTERU20086530, 
UTERU20087070, UTERU20087850, UTERU20099510, UTERU20101150, UTERU201 04310, UTERU201 27030, 
UTERU20 185220 

[0245] In particular, hit data of the following 328 clones for Swiss-Prot, or GenBank, UniGene, or nr corresponded 
to genes or proteins which had been deposited in the Online Mendelian Inheritance in Man (OMIM), which is the human 

10 gene and disease database, (the OMIM Number is shown in the parenthesis after the Clone Name). 

ADRGL20020290 (602193), ADRGL20021 91 0 (605717), ADRGL20026790 (605046), ADRGL20036840 (142800), 
ADRGL20059610 (230800;230900;231000;231005), ADRGL20066770 (130660), ASTRO20038400 (604764), 
ASTRO20052420 (600888), ASTRO20055570 (1 76640; 1 23400; 137440;245300;600072), ASTRO20075150 
(601896), ASTRO20088950 (603202;223000;223100), BNGH420008150 (600050), BNGH420086030 (118423), 

15 BRACE1 0000510 (148021), BRACE20003310 (603899), BRACE20069000 (204200), BRACE20097540 (604908), 
BRACE20194670 (314375), BRACE20196180 (605535), BRACE20204670 (176884), 

BRACE20216950 (158070), BRAMY20003540 (602142), BRAMY20005080 (604735), BRAMY20035830 (603524), 
BRAMY20040580 (604077), BRAMY20043630 (602775), BRAMY20044920 (603486), BRAMY20051 820 (604567), 
BRAMY20056620 (210210), BRAMY20089770 (602566), BRAMY20111 780 (604077), BRAMY2015251 0 (176879), 

20 BRAMY20190550 (600051), BRAMY20221600 (605789), BRAMY20227860 (605416), BRAMY2027451 0 (180475), 
BRAWH20082920 (603246), BRAWH20093040 (602989), BRAWH20095900 (602277), BRAWH201 90530 (605208), 
BRAWH20191980 (239500), BRCAN1 0000760 (111000), BRCAN 10001 050 (603696), BRCAN20005230 (603268), 
BRSSN20066440 (603430), CTONG20004520 (603817), CTONG20029030 (602775), CTONG20042640 (103390), 
CTONG20045500 (106195), CTONG20052780 (605612), CTONG20053990 (602187), CTONG20070780 (118990), 

25 CTONG20070910 (604450), CTONG20072930 (314995), CTONG20083980 (601703), CTONG20084660 (600834), 
CTONG20165750 (182465), CTONG20169040 (148030), CTONG20183430 (106410), CTONG20183830 (600382), 
CTONG20186290 (100660), CTONG201 89000 (600888), DFNES2001 6470 (605952), DFNES20025500 (604581), 
DFNES20046840 (60261 7;241 850), DFNES20055400 (603456), DFNES20080880 (602273), FCBBF1 0000230 
(602327), FCBBF20035490 (602489), FCBBF20066340 (603560), FCBBF30002270 (142708), FCBBF30002280 

30 (176763), FCBBF30019140 (602120), FCBBF30053300 (600299), FCBBF30071500 (125485), FCBBF30072440 
(604455), FCBBF30076310 (176892), FCBBF30080730 (600572), FCBBF30100080 (602488), FCBBF30115920 
(603577), FCBBF30118670 (603640), FCBBF3012901 0 (601260), FCBBF30132050 (603018), FCBBF301 36230 
(189909), FCBBF30153170 (171860;171850), FCBBF301 6451 0 (603006), FCBBF30166220 (182144), 
FCBBF30171230 (162151), FCBBF301 75350 (602521), FCBBF301 94550 (182900), FCBBF30220050 (600380), 

35 FCBBF30223210 (300022), FCBBF30263080 (194558), FCBBF30275590 (601403), FCBBF50001 650 (605268), 
FEBRA20027070 (314995), FEBRA20045380 (602942), FEBRA20046200 (106410), FEBRA2004651 0 (604077), 
FEBRA20057010 (602187), FEBRA20063720 (603899), FEBRA20078800 (601 825;256000), FEBRA20087550 
(600811), FEBRA20088810 (603725), FEBRA200901 60 (600137), FEBRA20092760 (602567), FEBRA201 70240 
(314997), FEBRA20173330 (602990), FEBRA201 91 720 (603895), HCHON1 0000150 (300163), HCHON20015050 

40 (151510), HEART20009590 (604581), HEART20022200 (601870), HEART200631 00 (602422), HHDPC20081230 
(164035), HLUNG20008460 (300108), HLUNG20014590 (604077), HLUNG20032460 (176785), HLUNG20063700 
(600210), HLUNG20065990 (186591), 

HLUNG20069350 (114212), HLUNG20081 530 (162230), HLUNG20082350 (604677), HLUNG20083330 (120180), 
HLUNG20085210 (604464), KIDNE20081170 (604535), KIDNE20084040 (602382), KIDNE20088240 (605084), 

45 KIDNE20089870 (602922), KIDNE20133460 (605430), KIDNE20134890 (117143), KIDNE20141700 (312760), 
KIDNE20142900 (188040), KIDNE20150730 (179715), KIDNE20152440 (602194), KIDNE20160360 (602488), 
KIDNE201 65390 (604649), KIDNE20169180 (191845), KIDNE201 73430 (603831), KIDNE201 89960 (275360), 
LIVER20026440 (601270), MESAN20006200 (151740), MESAN20021130 (600050), MESAN20033220 (600466), 
MESAN20056890 (600813), MESAN20057240 (126380), MESAN20065990 (601959), MESAN20067430 (191010), 

50 MESAN20069530 (604362), NESOP20004520 (153432), NT2NE2001 8890 (606031), NT2NE20026200 (277730), 
NT2NE20037050 (300028), NT2NE20053950 (604078), NT2NE20061030 (600834), NT2NE20111190 (602619), 
NT2NE20117580 (601 825;256000), NT2NE20119980 (191161), NT2NE20140130 (601281), NT2NE20141040 
(602917), 

NT2RI20093010 (172460), NT2RP7000311 0 (1301 60; 194050), NT2RP7004641 0 (601930), NT2RP70075300 
55 (601856), NTONG20032100 (1 48065; 1 93900), NTONG20034540 (602658), OCBBF20000740 (602059), 
OCBBF20012520 (602059), OCBBF201 1 1 600 (147625), OCBBF201 2001 0 (605008), OCBBF20156450 (314997), 
OCBBF20157970 (604077), OCBBF201 91950 (192977), PEBLM20001800 (146900), PEBLM20003260 (194558), 
PLACE60004260 (601891), PLACE60012620 (214500), PLACE60054230 (300108), PLACE60054870 (160776), 
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PLACE60062660 (606004), 

PLACE60087680 (146732), PLACE601 84870 (172425), PROST20015210 (160745), PROST20024250 (604078), 
PROST20036350 (138295), PROST20050390 (601258), PROST20058860 (182282), PROST20063430 (603292), 
PROST20065790 (171840), PROST20084720 (604426), PROST20099090 (602714), PROST20120070 (602809), 

5 PROST20127450 (602960), PROST201 46590 (158340,113720), PROST2015251 0 (603367), PROST201 68600 
(604415), PUAEN 10000650 (602960), PUAEN200031 20 (601573), SKMUS20008730 (602127), SKMUS20017400 
(191030), SKMUS20040440 (604163), SKMUS20073590 (605834), SKMUS20079150 (605596), SKNSH20009710 
(191030;164970), SMINT20002320 (601644), SMINT20007470 (190370), SMINT20008110 (604384), 
SMINT20011950 (603430) : SMINT2001 61 50 (134790), SMINT20026200 (159556), SMINT20030740 (604078), 

10 SMINT20049920 (600417), SMINT20077960 (1 37350; 1 05120), SMINT20083290 (146900), SMINT20086250 
(238330), SMINT20089600 (605926), SMINT20091190 (146900), SPLEN20023540 (605577), SPLEN20024190 
(601548), SPLEN20042200 (604167), SPLEN20043680 (126340;234050;278730), SPLEN20055600 (194541), 
SPLEN20057830 (179715), SPLEN20059270 (602165), SPLEN20063890 (600245), SPLEN20073500 (603300), 
SPLEN20080070 (230000), SPLEN20085910 (603424), SPLEN20090880 (142800), SPLEN20098030 (601742), 

15 SPLEN20118050 (301870), SPLEN20136730 (605412), SPLEN20138600 (603728), SPLEN20139100 (147120), 
SPLEN20139360 (117140), SPLEN201 80980 (156560), SPLEN201 87490 (179838), SPLEN201 93790 (147150), 
SPLEN20201830 (301870), TESTI1 00001 90 (158340;1 13720), 

TESTI20031310 (107280), TESTI20035790 (601940), TESTI20049060 (603889), TESTI20050720 (245050), 
TESTI20051200 (602273), TESTI20057430 (194532), TESTI20057590 (601890), TESTI20059080 (604038), 

20 TESTI20062120 (604212), TESTI20067480 (602277), TESTI20071 630 (602692), TESTI20099350 (160776), 
TESTI20105130 (310400), TESTI201 0591 0 (601328), TESTI201 08060 (600590), TESTI20125920 (601934), 
TESTI20130530 (146680), TESTI20131440 (114850), TESTI20134680 (117143), TESTI20142540 (137960), 
TEST120143180 (117143), TESTI20150420 (602732), TESTI20154370 (600936), TESTI20164210 (602319), 
TESTI20166670 (142968), TESTI201 68880 (151410), TESTI201 71 070 (604064), TESTI201 82760 (601328), 

25 TESTI20184750 (150320), TESTI201 93080 (602128), TESTI201 94880 (602260), TESTI201 96970 (601117), 
TESTI20197600 (604167), TESTI20201 760 (602162), TESTI202071 70 (480100), TESTI2021 911 0 (601890), 
TESTI20228740 (604027), TESTI20244430 (182900), TESTI20246480 (601486), TESTI20251740 (602731), 
TESTI20252690 (601368), TESTI20254030 (602330), TESTI2025791 0 (142871), TESTI20258720 (182900), 
TESTI20266050 (109092), TESTI20271 790 (604678), TESTI20274960 (194558), TESTI20282530 (604077), 

30 TESTI20286590 (147267), THYMU10004280 (602290), THYMU20006020 (601149), THYMU20013250 (601988), 
THYMU20019260 (603899), THYMU20023560 (142765), THYMU20028150 (190197), THYMU20032820 (604077), 
THYMU20034400 (604449), THYMU20055460 (133280), THYMU20063650 (180480), THYMU20070250 (277730), 
THYMU20071120 (603899), THYMU20081110 (602567), THYMU20090230 (602324), THYMU20095920 (605349), 
THYMU20098350 (148040;131 760;1 31 800;131 900), THYMU20099060 (146900), THYMU20 120730 (100660), 

35 THYMU20121040 (130592), THYMU201 70080 (604964), THYMU201 85650 (602121 ;1 24900), THYMU201 91 970 
(604265), TKIDN10000620 (605072), TKIDN1 0001 920 (603899), TRACH20011540 (191155), TRACH20091070 
(100660), TRACH20143710 (601905), TRACH20170860 (147170), UTERU10001060 (311040), UTERU20026620 
(314997), UTERU20041630 (602277), 

UTERU20086530 (173310), UTERU20087070 (216950), UTERU20087850 (605248), UTERU2009951 0 (604077), 
40 UTERU20101150 (164012), UTERU20104310 (604414), UTERU20127030 (150325), UTERU201 85220 (600014) 

[0246] The clones predicted to belong to the category of enzyme and/or metabolism-related protein are the following 
219 clones. 

ADRGL20059610, ASTRO20026320, ASTRO2005081 0, ASTRO20088950, BNGH4200081 50, BNGH420035290, 
BNGH420074600, BRACE20050870, BRACE20097540, BRACE20200770, BRACE20204670, BRACE20215410, 

45 BRAMY20003540, BRAMY20005080, BRAMY20027990, BRAMY20028620, BRAMY20044920, BRAMY20055760, 
BRAMY20056620, BRAMY20072870, BRAMY20093490, BRAMY20096930, BRAMY20118490, BRAMY201 25360, 
BRAMY201 43870, BRAMY2015251 0, BRAMY20231150, BRAMY20244490, BRAMY20251 21 0, BRAWH20021 91 0, 
BRAWH20082920, BRAWH20093040, BRAWH20094900, BRAWH201 831 70, BRAWH201 88750, BRAWH201 90550, 
BRAWH20191980, BRCAN20005230, BRCOC20003600, CTONG20051100, CTONG2007091 0, CTONG2007681 0, 

50 CTONG20079590, CTONG20080140, CTONG20085210, CTONG201 86290, DFNES20063460, DFNES20080880, 
FCBBF20023490, FCBBF20066340, FCBBF30004340, FCBBF30019140, FCBBF30022680, FCBBF30029250, 
FCBBF30072440, FCBBF3007631 0, FCBBF30085560, FCBBF30091520, FCBBF30 107290, FCBBF301 25880, 
FCBBF301 32050, FCBBF301 43550, FCBBF301531 70, FCBBF301 66220, FCBBF301 71230, FCBBF301 75350, 
FCBBF30236670, FCBBF30260480, FEBRA20038220, FEBRA20040560, FEBRA20078800, FEBRA200901 60, 

55 FEBRA201 72230, FEBRA201 73330, HCHON20000870, HCHON20002710, HEART10001490, HEART20022200, 
HEART20047640, HEART20082570, HLUNG20011 260, HLUNG20032460, HLUNG20041540, HLUNG20042730, 
HLUNG20054790, KIDNE20080690, KIDNE20083620, KIDNE20084040, KIDNE201471 70, KIDNE201 52440, 
KIDNE201 731 50, KIDNE201 861 70, KIDNE201 89960, LIVER^001 1 640, LI VER20026440, LIVER20055270, 
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MESAN20021130, MES AN 20 033220, MESAN20038520, ME5AN20057240, MESAN20058110, MESAN20065990, 
MESAN20095800, NT2NE20026200 : NT2NE20042550, NT2NE20117580, NT2NE201 27900, NT2RI20093010, 
NT2RP70064570, NTONG2 003 4540, NTONG20043080, NTONG20053630, NTONG20053730, NTONG20058010, 
OCBBF20120010, OCBBF201 67290, OCBBF201 91 950, PANCR1 0000860, PLACE60052940, PLACE60064180, 

5 PLACE60073090, PLACE60095600, PLACE601 84410, PLACE601 88630, PROST20007600, PROST20033240, 
PROST20036350, PROST20039300, PROST20050390, PROST20051310, PROST20052850, PROST20065790, 
PROST20075280, PROST20084720, PROST20099090, PROST201 08850, PROST20152510, PUAEN20001520, 
PUAEN20002470, SKNMC20006350, SKNSH20057920, SMINT20008110, SMINT20049920, SMINT20094680, 
SPLEN20023540, SPLEN20024930, SPLEN20043680, SPLEN20048800, SPLEN20054500, SPLEN20057900, 

10 SPLEN20071820, SPLEN20080070, SPLEN2008591 0, SPLEN201 08000, SPLEN201 36730, SPLEN201 80980, 
TESTI2001 2080, TESTI20030200, TESTI20031 31 0, TEST120038240, TESTI20050720, TESTI20051 200, 
TESTI20059080, TESTI200621 20, TESTI20066330, TESTI20076570, TESTI201 03690, TESTI201 051 30, 
TESTI201 06820, TESTI201 08060, TESTI201 1 2860, TESTI201 21 040, TESTI201 30530, TESTI201 31 440, 
TESTI201 68880, TESTI20 1 701 70, TESTI201 96690, TESTI201 96970, TESTI201 991 1 0, TESTI20205250, 

15 TESTI20212970, TESTI20222030, TESTI20226520, TESTI20227380, TESTI20244460, TESTI20244730, 
TESTI20250630, TESTI20260640, TESTI20262940, TESTI20264530, TESTI20285230, THYMU20006020, 
THYMU20013250, THYMU20034400, THYMU20039320, THYMU20055460, THYMU20055760, THYMU20063650, 
THYMU20066660, THYMU20070250, THYMU20087270, THYMU20096580, THYMU201 00940, THYMU20110720, 
THYMU201 20240, THYMU201 20730, THYMU201 70230, TRACH20011010, TRACH20021380, TRACH20091 070, 

20 TRACH201 13020, TRACH20143710, TRACH201 64100, TRACH201 90460, UTERU20087070, UTERU20089620, 
UTERU201 04310, UTERU201 85220, UTERU201 88670 

[0247] The clones predicted to belong to the category of cell division and/or cell proliferation-related protein are the 
following 44 clones. 

ASTRO20090680, BRACE20079370, BRAMY20234820, BRCAN1 0001 050, BRCAN2000541 0, CTONG20032930, 
25 FCBBF20070950, FCBBF30002270, FCBBF30053300, FCBBF301 05860, FCBBF301 75350, FCBBF30215240, 
FCBBF30275590, FEBRA20045380, HLUNG20068120, KIDNE20134890, KIDNE20150730, MESAN20021 470, 
NT2NE20077250, NT2NE201 53620, NT2RP70030840, NTONG2005391 0, OCBBF2011 1370, OCBBF201 74580, 
PROST20063430, SKNMC10001230, SMINT20028800, SPLEN20023540, SPLEN20057830, SPLEN201 39360, 
TESTI20031 41 0, TESTI20057840, TESTI20065650, TESTI20066650, TESTI201 07320, TESTI201 08060, 
30 TESTI201 14480, TESTI201 34680, TESTI201431 80, TESTI201 50920, TESTI20201760, TESTI20278280, 
TESTI20284260, THYMU 20097920 

[0248] The clones predicted to belong to the category of cytoskeleton-related protein are the following 80 clones. 
ADRGL20062330, ASTRO20053430, BGGI1 20000670, BRACE20079370, BRAMY20038980, BRAMY20083330, 
BRAMY20094890, CTONG2000411 0, CTONG20032930, CTONG20077760, CTONG20083980, CTONG201 69040, 

35 CTONG201 83430, DFNES20018000, FCBBF301 05860, FCBBF3013041 0, FCBBF301 94550, FCBBF30201 630, 
FCBBF30271 990, FEBRA20005040, FEBRA20046200, FEBRA20099860, HCHON20015050, HLUNG20081530, 
KIDNE20081170, NT2RP70001 730, NT2RP70003110, NTONG200321 00, OCBBF201 66890, OCBBF201 74890, 
PLACE60054870, PLACE60055590, PLACE60071800, PLACE6011881 0, PROST200 15210, PROST20097840, 
PROST201 20070, PROST201 46590, SKMUS20007260, SKMUS20008730, SKMUS2001 7400, SKMUS20073590, 

40 SMINT20062050, SMINT20074330, SMINT20077960, SPLEN200391 80, SPLEN20049840, SPLEN20076470, 
SPLEN201 82990, SPLEN201 87490, SPLEN201 9571 0, TESTI1 00001 90, TESTI20041 630, TESTI20057880, 
TESTI20058920, TESTI20060080, TESTI20064530, TESTI20064650, TESTI20065650, TESTI20067440, 
TESTI20071 1 30, TESTI20099350, TESTI201 1 2540, TESTI201 25280, TESTI201 3601 0, TESTI201 5331 0, 
TESTI201 75370, TESTI20222460, TESTI20244430, TESTI20254030, TESTI20258720, THYMU20024500, 

45 THYMU20062610, THYMU20098350, TRACH20043360, TRACH20098510, TRACH20 149500, UTERU20089390, 
UTERU201 22520, UTERU201 68960 

[0249] The clones predicted to belong to the category of nuclear protein and/or RNA synthesis-related protein are 
the following 70 clones. 

ASTRO20026320, BRACE20050870, BRACE20200770, BRAMY201 34050, BRAWH20063010, BRAWH20093040, 
50 BRAWH201 74330, BRAWH201 76850, CTONG20042640, FCBBF20023490, FCBBF20035490, FCBBF20070950, 
FCBBF30002270, FCBBF30048420, FCBBF30080730, FCBBF3011 5920, FCBBF30236670, FEBRA20035240, 
FEBRA20092760, FEBRA201 73330, HHDPC20081230, HLUNG20011460, HLUNG20068120, KIDNE20089870, 
KIDNE20150730, MESAN20056890, MESAN20057240, NT2NE20037050, NT2NE201 67660, NT2RP70031070, 
NTONG20053730, PLACE600641 80, PLACE60095600, PROST2001 6760, PROST2005131 0, PROST20058860, 
55 PROST201 52510, PUAEN20002470, SKMUS20079150, SKNSH20030640, SPLEN20023850, SPLEN20057830, 
SPLEN201 39360, SPLEN201 90430, TESTI20006830, TESTI20030200, TESTI20031 41 0, TESTI20035790, 
TESTI200621 20, TESTI20065650, TESTI20081 890, TESTI201 50920, TESTI201 5331 0, TESTI20201 760, 
TESTI2021 2970, TESTI20227380, TESTI20251 740, TESTI20256560, TESTI20260640, TESTI202701 30, 
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TESTI20284260, TESTI20285230, THYMU20021090, THYMU20049060, THYMU20066660, THYMU20081110, 
THYMU20090230, THYMU201 20240, UTERU1 0001 060, UTERU20104310 

[0250] The clones predicted to belong to the category of protein synthesis and/or trans port- related protein are the 
following 20 clones. 

5 BRAMY20038980, BRAMY2027451 0, CTONG20008190, CTONG20033610, FCBBF2001 8680, FEBRA20090220, 
KIDNE20141700, NT2NE201 67660, NTONG20055200, PLACE600 12620, PROST20036350, PROST20062820, 
SKMUS20040440, SMINT20000070, SPLEN201 80980, TESTI20055680, TESTI20067440, TESTI201 07240, 
THYMU20096580, THYMU20121040 

[0251] The clones predicted to belong to the category of cellular defense-related protein are the following 1 0 clones. 
10 ASTRO20089600, BRAMY2011 7670, FEBRA20087550, HLUNG20081 390, MESAN20057240, NTONG20031580, 
PROST20007600, SPLEN20023850, SPLEN20043680, TESTI20261680 

[0252] The clones predicted to belong to the category of development and/or differentiation-related protein are the 
following 19 clones. 

BRACE20061620, BRACE20200770, BRAMY20013670, CTONG2001 7490, CTONG20020950, HCHON1 00001 50, 
15 MESAN20021470, OCBBF201 65910, PROST201 55370, PUAEN20002470, TESTI20079220, TESTI20079980, 
TESTI201 66670, TESTI201 84760, TESTI20252690, TRACH20040390, UTERU20089620, UTERU20094830, 
UTERU201 69020 

[0253] The clones predicted to belong to the category of DNA-binding and/or RNA-binding protein are the following 
1 68 clones. 

20 ASTRO20038400, BGG1 12001 0750, BNGH420070370, BRACE2000331 0, BRACE20061 620, BRAMY20001510, 
BRAMY20040580, BRAMY20076100, BRAMY20111780, BRAMY2027451 0, BRAWH20040680, BRAWH20050740, 
BRAWH20063010, BRAWH20080580, BRAWH20095900, BRAWH201 74330, BRSSN20066440, CTONG20020950, 
CTONG20044230, CTONG20053990, CTONG20072930, CTONG20074000, CTONG201 65750, CTONG201 86370, 
CTONG201 86520, DFNES20046840, DFNES20073320, FCBBF20 035430, FCBBF20070950, FCBBF30002270, 

25 FCBBF30003610, FCBBF30019140, FCBBF30021 900, FCBBF30048420, FCBBF30080730, FCBBF300931 70, 
FCBBF301 14850, FCBBF3012901 0, FCBBF301 36230, FCBBF30220050, FCBBF30228940, FCBBF30236670, 
FCBBF30263080, FCBBF30285930, FCBBF50003530, FEBRA20026820, FEBRA20027070, FEBRA20035240, 
FEBRA20046510, FEBRA2005701 0, FEBRA20063720, FEBRA20087550, FEBRA20092760, FE BR A201 70240, 
FEBRA201 77800, HCHON20002650, HEART20019310, HEART20063100, HHDPC20081230, HLUNG20011460, 

30 HLUNG20014590, HLUNG20028110, HLUNG20063700, HLUNG20068120, KIDNE201 40870, LIVER20006260, 
MESAN20016270, MESAN20056890, MESAN20057240, NT2NE20038870, NT2NE20053950, NT2NE20060750, 
NT2NE20079670, NT2NE20082600, NT2NE20087270, NT2RP70029780, NT2RP70046410, NT2RP70057500, 
NT2RP70075300, NT2RP70090870, OCBBF201 16250, OCBBF201 20950, OCBBF20121910, OCBBF201 56450, 
OCBBF201 57970, OCBBF201 66900, OCBBF201 75360, OCBBF201 77540, PEBLM1 0001 470, PEBLM20003260, 

35 PLACE60066970, PLACE601 22970, PLACE601 77880, PROST200071 70, PROST20024250, PROST200351 70, 
PROST20051310, PROST20058860, PROST20151370, PROST20 155370, PUAEN20003120, SMINT20011950, 
SMINT20030740, SMINT20039050, SMINT20044140, SMINT20086720, SPLEN20042200, SPLEN20043680, 
SPLEN20055600, SPLEN20059270, SPLEN20063250, SPLEN201 39360, SPLEN201 90430, TEST1 1000 1570, 
TESTI20006830, TESTI20030200, TESTI20031 41 0, TESTI20035790, TESTI20057430, TESTI2005981 0, 

40 TESTI20062120, TESTI20067480, TESTI20068790, TESTI20075240, TESTI20079220, TESTI20088840, 
TESTI201 04090, TESTI201 34970, TESTI201 66670, TESTI201 71 070, TESTI201 73960, TESTI201 84760, 
TESTI201 97600, TESTI20201 760, TESTI2021 2970, TESTI20227380, TESTI20228740, TESTI20246480, 
TESTI20254030, TESTI20254990, TESTI20266050, TESTI20268240, TESTI202701 30, TESTI20274960, 
TESTI20282530, TESTI20284260, TESTI20285230, THYMU 10004280, THYMU20019260, THYMU20023560, 

45 THYMU20032820, THYMU20049060, THYMU20066660, THYMU20071120, THYMU20077250, THYMU20081110, 
THYMU20090230, TKIDN1 0001920, TRACH20 108240, UTERU1 0001 060, UTERU20026620, UTERU20041630, 
UTERU20094830, UTERU2009951 0, UTERU201 01150, UTERU201 69020, UTERU20177150, UTERU201 88670 
[0254] The clones predicted to belong to the category of ATP binding and/or GTP-binding protein are the following 
93 clones. 

50 ASTRO20026320, BNGH420035290, BRACE20050870, BRACE20079370, BRACE20200770, BRAMY20055760, 
BRAMY201 18490, BRAMY20244490, BRAMY20251 21 0, BRAWH20093040, BRAWH201 90550, BRCAN1 0001050, 
BRCOC20003600, CTONG20008190, CTONG20030280, CTONG20032930, CTONG20 176040, CTONG201 84830, 
FCBBF20023490, FCBBF30019140, FCBBF30076310, FCBBF301 05860, FCBBF30 175350, FCBBF30201630, 
FCBBF30236670, FEBRA20005040, FEBRA20090160, FEBRA201 73330, HCHON20000870, HLUNG20011260, 

55 HLUNG20052300, KIDNE200811 70, KIDNE201 34890, LIVER20030650, LIVER20055270, MESAN20065990, 
NT2NE20042550, NTONG20043080, NTONG20055200, OCBBF201 82060, PLACE60054870, PLACE600641 80, 
PLACE60095600, PLACE 60 140640, PROST2001 521 0, PROST20033240, PROST20036350, PROST20051 31 0, 
PROST20052850, PROST20062820, PROST20075280, PROST201 20070, PUAEN20002470, SKNSH20052400, 
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SKNSH20057920, SMINT20008110, SPLEN20023850 t SPLEN20043680, SPLEN20049840, SPLEN201 36730, 
SPLEN201 80980, SPLEN201 93790, TESTI20055680, TESTI20058920, TESTI20060080, TESTI20064650, 
TESTI20071 1 30, TESTI20099350, TESTI201 06820, TESTI201 1 2860, TESTI201 34680, TESTI201 3601 0, 
TESTI201 431 80, TESTI201 75370, TESTI2021 2970, TESTI20222460, TESTI20227380, TESTI20244220, 
5 TESTI20244460, TESTI 2 0264530, THYMU20013250, THYMU20039320, THYMU20062610, THYMU20066660, 
THYMU20087270, THYMU20096580, THYMU201 00940, THYMU201 76010, TRACH20043360, TRACH20098510, 
TRACH201 13020, UTERU201 85220, UTERU201 88670 

[0255] Among the clones other than the ones shown above, BNGH42003641 0 and FCBBF30257370 are clones 
which were predicted to highly possibly belong to the category of secretory protein and/or membrane protein based 
10 on the result of domain search by Pfam. 
SMINT20044730, TESTI20140970 

[0256] The two clones shown above are clones which were predicted to highly possibly belong to the category of 
glycoprotein -related protein based on the result of domain search by Pfam. 

BRACE20055560, CTONG20046690, DFNES2004371 0, FCBBF30005500, MESAN20030350, MESAN20030370, 
15 PLACE60074820, TESTI20058350, TESTI20106170, TRACH20131230, UTERU20000950 

[0257] The 11 clones shown above are clones which were predicted to highly possibly belong to the category of 
signal transduction-related protein based on the result of domain search by Pfam. 

ASTRO20010290, BRACE20099070, CTONG20007660, DFNES20076340, DFNES20094820, FCBBF301 25460, 
FCBBF301 42290, FCBBF301 69280, FEBRA20031 000, NT2NE2002651 0, NT2RP70031 340, PLACE50001390, 
20 SPLEN201 35030, TESTI20046890, TESTI20060350, TESTI20 166290, TESTI20259110, THYMU20 184550 

[0258] The 18 clones shown above are clones which were predicted to highly possibly belong to the category of 
transcription -related protein based on the result of domain search by Pfam. 

ADRGL20047770, ADRGL20079060, BRACE20014450, BRACE20051 600, BRAWH 201 85260, CTONG20033750, 
CTONG20070090, CTONG20 190290, FCBBF2 002 0440, FCBBF30005360, FCBBF301 73960, FEBRA20031000, 
25 KIDNE20087880, LIVER20013890, MESAN20030350, MESAN20030370, OCBBF201 131 1 0, PLACE60074820, 
PLACE60093380, PROST20028970, PROST201 02190, SALGL1 0001 070, SPLEN20006950, SPLEN20011350, 
SPLEN20050090, TESTI20060830, TESTI200661 50, TESTI201 20900, TESTI201 3231 0, TESTI201 48380, 
TESTI201 62980, TESTI201 66290, TESTI20205100, THYMU20112590, TRACH20029880 

[0259] The 35 clones shown above are clones which were predicted to highly possibly belong to the category of 
30 enzyme and/or metabolism-related protein based on the result of domain search by Pfam. 
PLACE60054820, TESTI201 97030 

[0260] The two clones shown above are clones which were predicted to highly possibly belong to the category of 
cell division and/or cell proliferation- related protein based on the result of domain search by Ram. 
ASTRO20006530, OCBBF2001 6390, TRACH20058000 
35 [0261] The three clones shown above are clones which were predicted to highly possibly belong to the category of 
cytoskeleton- related protein based on the result of domain search by Pfam. 
BRACE20065470, PLACE60054820 

[0262] The two clones shown above are clones which were predicted to highly possibly belong to the category of 
nuclear protein and/or RNA synthesis- related protein based on the result of domain search by Pfam. 
40 ASTRO20010290, BRACE20099070, BRAWH200 14590, CTONG20007660, DFNES20076340, DFNES20094820, 
FCBBF301 25460, FCBBF301 42290, FCBBF301 69280, FEBRA20031000, MESAN20034440, NT2NE20026510, 
NT2RP70031 340, PLACE50001 390, SPLEN201 35030, TESTI20046890, TESTI20060350, TESTI201 66290, 
TESTI20259110, THYMU20 104480, THYMU20 184550 

[0263] The 21 clones shown above are clones which were predicted to highly possibly belong to the category of 
45 DNA-binding and/or RNA-binding protein based on the result of domain search by Ram. 
KIDNE20133880, MESAN20030350, MESAN20030370, TESTI20059480 

[0264] The four clones shown above are clones which were predicted to highly possibly belong to the category of 
ATP-binding and/or GTP-binding protein based on the result of domain search by Pfam. 

[0265] The 205 clones shown below are clones which were unassignable to any of the above-mentioned categories, 
50 but have been predicted to have some functions based on homology search for their full-length nucleotide sequences 

and motif search in their deduced ORFs. Clone Name, Definition in the result of homology search or Motif Name in 

the motif search, demarcated by a double slash mark (//), are shown below. 

ADRGL20022600//DIAPHANOUS PROTEIN HOMOLOG 1 (P140MDIA). 

ADRGL20023920//ABC1 PROTEIN HOMOLOG PRECURSOR. 
55 ASTRO20001 91 0//Rattus norvegicus mRNA for annexin V-binding protein (ABP-1 0), partial cds. 

ASTRO20009140//PUTATIVE COMPETENCE-DAMAGE PROTEIN. 

ASTRO20046280//PSU1 PROTEIN. 

ASTRO20058960//DNA damage inducible protein homolog - fission yeast (Schizosaccharomyces pombe) 



110 



EP 1 308 459 A2 



BNGH420024870//C2 domain// C2 domain// C2 domain 

BRACE20007330//RING CANAL PROTEIN (KELCH PROTEIN). 

BRACE20052430//Homo sapiens AMSH mRNA, complete cds. 

BRACE20054600//Xenopus laevis mRNA for Kielin, complete cds. 
5 BRACE2005981 0//TSC-22/dip/bun family 

BRACE20063540//MEROZOITE SURFACE PROTEIN CMZ-8 (FRAGMENT). 

BRACE20079200//Xenopus laevis mRNA for Kielin, complete cds. 

BRAMY20016780//Proprotein convertase P-domain 

BRAMY20023640//UBX domain 
10 BRAMY20045420//Domain found in Dishevelled, Egl-10, and Pleckstrin 

BRAMY20056840//UBE-1 c2 

BRAMY20063750//Homo sapiens HRIHFB2007 mRNA, partial cds. 
BRAMY20102900//Homo sapiens RU1 (RU1) mRNA, complete cds. 
BRAMY201 58550//CALMODULIN. 
15 BRAMY20223010//Mus musculus leucine-rich glioma-inactivated 1 protein precursor, (Lgi1) mRNA, complete cds. 
B R AMY20238630//TETR ATR ICO P E PTI D E REPEAT PROTEIN 4. 
BRAMY20245760//Araneus diadematus fibroin-4 mRNA, partial cds. 
BRAWH20047790//HMG (high mobility group) box 

BRSSN2000561 0//Mus musculus semaphorin cytoplasmic domain-associated protein 3A (Semcap3) mRNA, complete 
20 cds. 

BRSSN20005660//Bacterial type I! secretion system protein 

BRSSN20093890//Homo sapiens mRNA for Kelch motif containing protein, complete cds. 
CTONG20041150//Streptomyces ansochromogenes strain 7100 SanE (sanE) gene, complete cds. 
CTONG20066110//Homo sapiens DEME-6 mRNA, partial cds. 
25 CTONG20069420//Ribosomal protein S1 4p/S29e 
CTONG2007I040//BETA CRYSTALLIN B2 (BP). 
CTONG20074170//DENN (AEX-3) domain 
CTONG20083430//Nuclear transition protein 2 

CTONG20170940//MYOTROPHIN (V-1 PROTEIN) (GRANULE CELL DIFFERENTIATION PROTEIN). 
30 CTONG20174290//TRICHOHYALIN. 

CTONG20174580//Homo sapiens mRNA for vascular Rab-GAP/TBC -containing protein complete cds. 

CTONG20180690//Collagen triple helix repeat (20 copies) 

CTONG20186550//cca3 protein - rat 

CTONG20188080/VTPR Domain 
35 FCBBF1 0004760//Homo sapiens GAP-like protein (N61 ) mRNA, complete cds. 

FCBBF20033360//RING CANAL PROTEIN (KELCH PROTEIN). 

FCBBF20041380//SAM domain (Sterile alpha motif) 

FCBBF20043730//UBA domain 

FCBBF20056580//Mus musculus NSD1 protein mRNA, complete cds. 

40 FCBBF20059660//TPR Domain 

FCBBF3001 9180//SERINEITHREONINE PROTEIN PHOSPHATASE 2A, 65 KDA REGULATORY SUBUNIT A, AL- 
PHA ISOFORM (PP2A, SUBUNIT A, PR65-ALPHA ISOFORM) (PP2A, SUBUNIT A, R1 -ALPHA ISOFORM). 
FCBBF30026580//Homo sapiens retinoblastoma-associated protein RAP140 mRNA, complete cds. 
FCBBF30035570//C2 domain 

45 FCBBF30079770//D-isomer specific 2-hydroxyacid dehydrogenases 

FCBBF301 00120//Mus musculus semaphorin cytoplasmic domain-associated protein 3A (Semcap3) mRNA, complete 
cds. 

FCBBF30100410//Mus musculus testis-specific Y-en coded-like protein (TspyH) mRNA, complete cds. 
FCBBF30118890//Drosophila melanogaster La related protein (larp) mRNA, partial cds. 
50 FCBBF30138000//trg protein - rat 

FCBBF30157270//Rattus norvegicus PAPIN mRNA, complete cds. 
FCBBF30161780//gag gene protein p24 (core nucleocapsid protein)// Zinc knuckle 
FCBBF30198670//dof protein - fruit fly (Drosophila melanogaster) 

FCBBF30222910//Mus musculus Rap2 interacting protein 8 (RPIPB) mRNA, complete cds. 
55 FCBBF30255680//Rattus norvegicus brain specific cortactin-binding protein CBP90 mRNA, partial cds. 
FCBBF30260210//Drosophila melanogaster KISMET-L long isoform (kis) mRNA, complete cds. 
FCBBF30282020//cca3 protein - rat 

FCBBF4000061 0//late gestation lung 2 protein [Rattus norvegicus]. 
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FEBRA20029620//Leucine Rich Repeat// Leucine Rich Repeat// Leucine Rich Repeat 

FEBRA20031150//Homo sapiens HSKM-B (HSKM-B) mRNA, complete cds. 

FEBRA20038330//Corticotropin-releasing factor family 

FEBRA20038970//Homo sapiens mRNA for stabilin-1 (stabl gene). 
5 FEBRA2008861 07/CELLULAR RETINALDE HYDE-BIN DING PROTEIN (CRALBP). 

FEBRA20150420//HYPOTHETICAL 131 .5 KDA PROTEIN C02F12.7 IN CHROMOSOME X. 

FEBRA201 75330//D- isomer specific 2-hydroxyacid dehydrogenases 

HEART10001420//Mus musculus skm-BOP1 (Bop) mRNA, complete cds. 

HLUNG20024050//Rubredoxin 
w HLUNG20030420//Mus musculus mRNA for MAIL, complete cds. 

HLUNG20030490//Ambystoma tigrinum RPE65 protein mRNA, complete cds. 

HLUNG20033060//Homo sapiens GAP-like protein (N61) mRNA, complete cds. 

HLUNG20041590//ubiquitous tetratricopeptide containing protein RoXaN [Homo sapiens]. 

HLUNG20045340//MOB2 PROTEIN (MPS1 BINDER 2). 
15 HLUNG20051330//FHIPEP family 

HLUNG20070410//Dihydropyridine sensitive L-type calcium channel (Beta subunit) 

HLUNG200721007/Gallus gallus Dach2 protein (Dach2) mRNA, complete cds. 

HLUNG20083480//Chicken mRNA for TSC-22 variant, complete cds, clone SLFEST52. 

KIDNE20027980//SAM domain (Sterile alpha motif) 
20 KIDNE20084730//Homo sapiens FH1/FH2 domain-containing protein FHOS (FHOS) mRNA, complete cds. 

KIDNE20149780//NG28 [Mus musculus] 

KIDNE20154330//Rattus norvegicus mRNA for multi PDZ domain protein. 

KIDNE20170400//Leucine Rich Repeat// Leucine Rich Repeat// Leucine Rich Repeat// Protein kinase C terminal do- 
main// Rubredoxin 

25 KIDNE20189890//Homo sapiens mRNA for KARP-1 -binding protein 2 (KAB2), complete cds. 

LIVER20010760//Homo sapiens C-type lectin-like receptor-1 mRNA, complete cds. 

LIVER20040740//RETINAL-BINDING PROTEIN (RALBP). 

MESAN20009090//Homo sapiens CEGP1 protein (CEGP1), mRNA 

MESAN20026870//PAN domain// TBC domain 
30 MESAN20090190//CEGP1 protein [Homo sapiens]. 

NT2NE20059680//Homo sapiens integrin cytoplasmic domain associated protein (lcap-1a) mRNA, complete cds. 

NT2NE20077270//Adenovirus EB1 55K protein / large t-antigen 

NT2NE20087850//ANTER-SPECIFIC PROLINE-RICH PROTEIN APG (PROTEIN CEX) (FRAGMENT). 
NT2NE20095230//Homo sapiens HSKM-B (HSKM-B) mRNA, complete cds. 
35 NT2NE20108420//KES1 PROTEIN. 

NT2NE20173970//Rattus norvegicus beta-catenin binding protein mRNA, complete cds. 
NT2 N E20 1 772 10//Leish mania major partial ppgl gene for proteophosphoglycan. 
NT2RP70012830//CALPHOTIN. 

NT2RP70035110//Caenorhabditis elegans UNC-89 (unc-89) gene, complete cds. 
40 NTONG20002230//Mus musculus RW1 protein mRNA, complete cds. 

NTONG20005310//Ribosomal protein S9/S16 

NTONG20029850//CALCYPHOSINE (R2D5 ANTIGEN). 

NTONG20035150//RING CANAL PROTEIN (KELCH PROTEIN). 

NTONG20058220//Homo sapiens phosphoprotein pp75 mRNA, partial cds. 
^5 OCBBF20005220//Rattus norvegicus Fos-related antigen mRNA, complete cds. 

OCBBF20011860//Mus musculus epithelial protein lost in neoplasm-a (Eplin) mRNA, complete cds. 

OCBBF20016810//enhancer of polycomb [Mus musculus] 

OCBBF20147070//DNA polymerase (viral) C-terminal domain 

OCBBF20160380//liver stage antigen LSA-1 - Plasmodium falciparum 
50 OCBBF201 77910//Corticotropin-releasing factor family 

PEBLM20005020//Virion host shutoff protein 

PLACE60055460//Homo sapiens leucine-zipper protein FKSG13 (FKSG13) mRNA, complete cds. 
PLACE60068710//SUP PRESSOR PROTEIN SRP40. 
PLACE60080360//mucin [Homo sapiens] 
55 PLACE60082850//Pathogenesis-related protein Bet v I family 

PLACE60098350//Human hepatocellular carcinoma associated protein (JCL-1) mRNA, complete cds. 
PLACE60105680//Homo sapiens mRNA for TU12B1-TY, complete cds. 
PLACE60119700//Homo sapiens mRNA for ABP32, complete cds. 
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PLACE60120280//SER/THR-RICH PROTEIN T10 IN DGCR REGION. 

PLACE60132200//TRICHOHYALIN. 

PLACE60181870//Pentaxln family 

PROST20084470//Plasmodium berghei strain NYU2 merozoite surface protein-1 mRNA, partial cds. 
5 PROST20087240//gag gene protein p24 (core nucleocapsid protein) 
PROST20122490//Gallus gallus syndesmos mRNA, complete cds. 
PROST20130320//S-100/ICaBP type calcium binding domain 
PROST20152870//Homo sapiens APC2 gene, exon 14. 

PUAEN10001640//Mus musculus cerebellar postnatal development protein-1 (Cpd1) mRNA, partial cds. 
10 PUAEN20000800//Bleomycin resistance protein 

SMINT20012220//Collagen triple helix repeat (20 copies) 

SMINT20035510//Drosophila melanogaster La related protein (larp) mRNA, partial cds. 
SMINT20036440//Drosophila melanogaster epsin-like protein mRNA, complete cds. 
SMINT20038660//Homo sapiens HNOEL-iso (HNOEL-iso) mRNA, complete cds. 
15 SMINT20043390//Ras association (RalGDS/AF-6) domain 
SMINT20048720//Cytochrome P450// Cytochrome P450 

SMINT20052130//Rattus norvegicus mRNA for gankyrin homologue, complete cds. 

SMINT20054050//ABC1 PROTEIN HOMOLOG PRECURSOR. 

SPLEN20024770//Rattus norvegicus (rsec6) mRNA, complete cds. 
20 SPLEN20040780//CORNIFIN B (SMALL PROLINE-RICH PROTEIN 1B) (SPR1B) (SPR1 B). 

SPLEN20041810//BC-2 protein [Homo sapiens] 

SPLEN20100040//258.1 KDA PROTEIN C210RF5 (KIAA0933). 

SPLEN20104150//Ribosomal protein L36 

SPLEN20116720//Homo sapiens misato mRNA, partial cds. 
25 SPLEN20176130//Homo sapiens mRNA for ALEX1 , complete cds. 

SPLEN20181570//TRICHOHYALIN. 

TEST1 2000431 0//TRICHOHYALIN. 

TESTI20016970//TPR Domain 

TESTI20030440//TRICHOHYALIN. 
30 TESTI200431 807/mouse mRNA for megakaryocyte potentiating factor, complete cds. 

TESTI20043910//IQ calmodulin-binding motif// IQ calmodulin-binding motif// IQ calmodulin-binding motif// IQ calmod- 
ulin-binding motif// IQ calmodulin-binding motif 

TESTI20044900//Strongylocentrotus purpuratus radial spokehead mRNA, complete cds. 
TESTI20046110//Extracellular link domain 
35 TESTI20047930//Homo sapiens NY-REN-2 antigen mRNA, complete cds. 
TESTI20049410//Proprotein convertase P-domain 
TESTI20053950//IQ calmodulin-binding motif 

TESTI20054700//Streptococcus pneumoniae strain g375 surface protein PspC (pspC) gene, pspC-8.1 allele, complete 
cds. 

40 TESTI20055880//Serum amyloid A protein 

TESTI20056030'//Homo sapiens 88-kDa Golgi protein (GM88) mRNA, complete cds. 
TESTI20061090//Keratin, high sulfur B2 protein 

TESTI20064370//TPR Domain// TPR Domain// TPR Domain// TPR Domain// Synaptobrevin 
TESTI20084250//OXYSTEROL-BINDING PROTEIN. 
45 TESTI200921 70//ENV polyprotein (coat polyprotein) 
TESTI20116050//UBX domain 
TESTI20120500//Kelch motif// Kelch motif 

TESTI20126280//Mus musculus STAP mRNA for sperm tail associated protein, complete cds. 
TESTI20144390//TESTIS-SPECIFIC PROTEIN PBS13. 
50 TESTI201 65990//Ribosomal protein L36 

TESTI201 69500//HYPOTHETICAL 51.9 KDA PROTEIN C27F1.04C IN CHROMOSOME I. 
TESTI20170280//Flagellar L-ring protein 

TESTI20176450/Ahioredoxin interacting factor [Mus musculus]. 
TESTI20179230//Dihydropyridine sensitive L-type calcium channel (Beta subunit) 
55 TESTI201 80600//Homo sapiens HOM-TES-85 tumor antigen mRNA, complete cds. 

TESTI20209050//HYPOTHETICAL 113.1 KDA PROTEIN IN PRE5-FET4 INTERGENIC REGION. 

TESTI20210570//RETINAL-BINDING PROTEIN (RALBP). 

TE STI 2021 531 0//Homo sapiens calcyclin binding protein mRNA, complete cds. 
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TESTI20247440//Human BLu protein testis isoform (BLu) mRNA, complete cds. 
TESTI20249360//Homo sapiens DEME-6 mRNA, partial cds. 
TESTI20250220//TRICHOHYALIN. 

TESTI20251440//Rattus norvegicus (rsec6) mRNA, complete cds. 
5 TESTI20255460//Mus musculus mRNA for MIWI (plwi), complete cds. 
THYMU20009500//TPR Domain 

THYMU20010180//MOB1 PROTEIN (MPS1 BINDER 1). 

THYMU20013810//Human SEC7 homolog Tic (TIC) mRNA, complete cds. 

THYMU20018250//TPR Domain 
w THYMU20026950//Mus musculus ROSA 26 transcription AS ROSA26AS mRNA, complete cds. 

THYMU20028410//Mus musculus Pax transcription activation domain interacting protein PTIP mRNA, complete cds. 

THYMU20030460//Homo sapiens tumor endothelial marker 7 precursor (TEM7) mRNA, complete cds. 

THYMU20031330//Homo sapiens putative nucleotide binding protein mRNA, complete cds. 

THYMU20052460//PHORBOLIN I (FRAGMENTS). 
15 THYMU20055450//Zona pellucida-like domain 

THYMU20083830//Homo sapiens angiostatin binding protein 1 mRNA, complete cds. 

THYMU20139160//Uncharacterized protein family UPF0031 

THYMU20151610//Homo sapiens antigen NY-CO-1 (NY-CO-1) mRNA, complete cds. 
TRACH20093400//TRICHOHYALIN. 
20 TRACH20104510//Uncharacterized protein family UPF0005 
TRACH20122980//HYPOTHETICAL PROTEIN MJ0798. 
TRACH20139280//PX domain 

TRACH20164820//D-isomer specific 2-hydroxyacid dehydrogenases 

TRACH20165540//Human alpha-1 type I collagen gene surrounding osteogenesis imperfecta Ol type II deletion. 
25 UTERU20051790//guanylate kinase-interacting protein 1 Maguin-1 , membrane-associated - rat 
UTERU20083020//Domain of unknown function DUF71 
UTERU201 21 1 40//Rhodanese-like domain 

UTERU20128560/126.4 KDA PROTEIN IN RUVC-ASPS INTERGENIC REGION. 
UTERU201326207/AXONEME- ASSOCIATED PROTEIN MST101(2). 
30 UTERU20134830//pellino (Drosophila) homolog 2 [Homo sapiens] 
UTERU20181270//Zinc knuckle 

[0266] With respect to the remaining 613 clones, there are so far no information available for estimating their func- 
tions. However, there is the possibility that the functions of these clones will be revealed in future. Their Clone Names 
are indicated below. 

35 ADRGL20027530, ADRGL20040310, ADRGL20040770, ADRGL20046760, ADRGL20047080, ADRGL20057560, 
ADRGL20067320, ADRGL20095330, ASTRO20003720, ASTRO200 04820, ASTRO2001 2270, ASTRO20020350, 
ASTRO20022020, ASTRO20027330, ASTRO20047510, ASTRO200 69200, ASTRO20076660, AST RO2009 1770, 
ASTRO20141740, BNGH41 0000570, BNGH420014060, BNGH420040760, BNGH420042910, BNGH420045380, 
BNGH420061350, BNGH420062340, BNGH4200851 00, BRACE20009050, BRACE2001 7790, BRACE2001 881 0, 

40 BRACE20025820, BRACE20038920, BRACE20054480, BRACE20057870 ! BRACE20059110, BRACE20062580, 
BRACE20069440, BRACE20098860, BRACE201 96960, BRACE20200970, BRACE20205840, BRACE20207420, 
BRACE20212450, BRACE20216700, BRACE20219360, BRAMY1 0000980, BRAMY20000210, BRAMY20000250, 
BRAMY20020440, BRAMY20021580, BRAMY20023390, BRAMY20036530, BRAMY2003681 0, BRAMY20039290, 
BRAMY20043520, BRAMY20050640, BRAMY20052440, BRAMY20073080, BRAMY200741 1 0, BRAMY20074860, 

45 BRAMY20076130, BRAMY20076530, BRAMY20095080, BRAMY20095570, BRAMY201 00680, BRAMY201 07980, 
BRAMY20120170, BRAMY201 24970, BRAMY20125170, BRAMY201 26910, BRAMY201 39750, BRAMY201 55500, 
BRAMY201 59250, BRAMY201 60020, B RAM Y201 73480, BRAMY2021 9620, BRAMY20225250, BRAMY20227230, 
BRAMY20227960, BRAMY202431 20, BRAMY20245350, BRAMY20267780, BRAMY20269040, BRAMY20271140, 
BRAMY20287400, BRAWH20020600, BRAWH20025490, BRAWH20027250, BRAWH20055240, BRAWH20055780, 

50 BRAWH20058120, BRAWH20078080, BRAWH20082550, BRAWH201 73790, BRAWH201 75230, BRAWH201 75340, 
BRAWH20 182670, BRAWH201 8601 0, BRCOC1 0000400, BRHIP20003590, BRHIP20005060, BRSSN20092440, 
CTONG1 0000090, CTONG20000340, CTONG20002790, CTONG20008460, CTONG2001 5240, CTONG20020 660, 
CTONG20027660, CTONG20031 150, CTONG20031 890, CTONG20033500, CTONG20035240, CTONG20036800, 
CTONG20039370, CTONG20050490, CTONG20055670, CTONG20057750, CTONG20057950, CTONG20061 290, 

55 CTONG20062730, CTONG20065240, CTONG20073990, CTONG20074740, CTONG20 076230, CTONG20081 840, 
CTONG201 33720, CTONG201 65590, CTONG201 66580, CTONG201 68460, CTONG201 69530, CTONG201 74440, 
CTONG201 79390, CTONG201 79980, CTONG201 80620, CTONG201 81350, CTONG201 84130, CTONG201 86140, 
CTONG201 90630, DFNES20032550, DFNES20088810, FCBBF1 0002200, FCBBF20021110, FCBBF20028980, 
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FCBBF20038230, 

FCBBF30001150, 

FCBBF30063990, 

FCBBF30088700, 

FCBBF301 05080, 

FCBBF301 30580, 

FCBBF301 95700, 

FEBRA20040260, 

FEBRA20086600, 

FEBRA20101410, 

FEBRA201 80510, 

HLUNG20011440, 

HLUNG20029420, 

HLUNG20038330, 

HLUNG20065700, 

HLUNG20083960, 

KIDNE20091090, 

KIDNE20142680, 

KIDNE20191870, 

MESAN20030390, 

MESAN20094180, 

NT2NE20062880, 

NT2NE20088030, 

NT2NE201 24570, 

NT2NE20157120, 

NT2RP70056290, 

OCBBF20001780, 

OCBBF201 19810, 

PLACE60017120, 

PLACE60072420,. 

PLACE60095240, 

PLACE60162100, 

PROST20014140, 

PROST20042700, 

PROST20078710, 

PROST201 03820, 

PROST201 59320, 

SKNSH20068220, 

SMINT20023110, 

SMINT20067080, 

SMINT20089220, 

SPLEN20012450, 

SPLEN20024510, 

SPLEN20051420, 

SPLEN20087860, 

SPLEN20126110, 

SPLEN201 99850, 

TEST1 10000850, 

TESTI20034750, 

TESTI20052670, 

TESTI20060450, 

TESTI20066590, 

TESTI20083890, 

TESTI20093900, 

TESTI201 18460, 

TESTI201 34270, 

TESTI201 65680, 

TESTI201 82210, 



FCBBF20038950, FCBBF2006131 0, FCBBF20070800, FCBBF30000010, 
FCBBF30002330, FCBBF30004730, FCBBF30005180, FCBBF3001 9240. 
FCBBF30068210, FCBBF30072480, FCBBF30074530, FCBBF30074620, 
FCBBF30089380, FCBBF30091 01 0, FCBBF30099490, FCBBF301 01240, 
FCBBF301 06950, FCBBF301 07330, FCBBF30114180, FCBBF301 15230, 
FCBBF30151190, FCBBF301 70710, FCBBF30179180, FCBBF30181 730, 
FCBBF40001920, FCBBF40005000, FCBBF50000410, FEBRA20035200, 
FEBRA20040290, FEBRA20076200, FEBRA20078180, FEBRA20082660, 
FEBRA20091620, FEBRA20093270, FEBRA20093280, FEBRA20095410, 
FEBRA201 08020, FEBRA201 08580, FEBRA20121200, FEBRA201 63980, 
FEBRA201 87460, HHDPC20082970, HLUNG20009260, HLUNG20009550, 
HLUNG20012140, HLUNG20020500, HLUNG20021450, HLUNG20023030, 
HLUNG20030610, HLUNG20031620, HLUNG20033310, HLUNG20037140, 
HLUNG20047070, HLUNG20055240, HLUNG20056560, HLUNG20057380, 
HLUNG20067810, HLUNG20072190, HLUNG20072450, HLUNG20079310, 
HLUNG20093030, HLUNG20094130, K1DNE20011 600, KIDNE20024380, 
KIDNE20094260, KIDNE20095530, KIDNE201 37310, KIDNE20138450, 
KIDNE20154830, KIDNE20155980, KIDNE20157100, KIDNE201 76030, 
LIVER20007690, LIVER20032340, MESAN20008940, MESAN20021860, 
MESAN20041380, MESAN20045750, MESAN20060220, MESAN20085360, 
NESOP20005040, NT2NE2 0028700, NT2NE20033150, NT2NE200451 90, 
NT2NE20066590, NT2NE20070520, NT2NE20073650, NT2NE20077860, 
NT2NE201 04000, NT2NE201 07810, NT2NE201 12210, NT2NE201 14850, 
NT2NE201 26030, NT2NE201 40280, NT2NE201 48690, NT2NE201 49500, 
NT2NE20165190, NT2NE201 81 760, NT2NE20 184720, NT2RP70022820, 
NT2RP70074800, NT2RP700801 50, NT2RP70084540, NTONG2001 7620, 
OCBBF20009820, OCBBF201 09450, OCBBF201 1 2320, OCBBF2011 5360, 
OCBBF201 42290, OCBBF201 52330, OCBBF201 88280, PLACE60001 91 0, 
PLACE60055350, PLACE60062870, PLACE60063940, PLACE60069880, 
PLACE60077870, PLACE60081260, PLACE60088240, PLACE60092280, 
PLACE601 09910, PLACE601 32320, PLACE601 32880, PLACE601 55910, 
PLACE601 75640, PLACE601 77910, PROST1 0001 360, PROST1 00021 50, 
PROST20014650, PROST20015400, PROST20022120, PROST20036280, 
PROST20047440, PROST20048770, PROST20052720, PROST20054660, 
PROST20093470, PROST20 094000, PROST200 97310, PROST20097360, 
PROST20121570, PROST20 124000, PROST201 25420, PROST201 38730, 
SKMUS20026340, SKMUS20064810, SKNSH10001010, SKNSH20007160, 
SMINT20011830, SMINT20013970, SMINT20014610, SMINT20017310, 
SMINT20031280, SMINT20045830, SMINT20045890, SMINT20047290, 
SMINT20070620, SMINT20077920, SMINT2008491 0, SMINT2008531 0, 
SMINT20092120, SMINT20093630, SMINT20094150, SPLEN20005160, 
SPLEN20015030, SPLEN20016500, SPLEN20019120, SPLEN20020530, 
SPLEN20029170, SPLEN20036780, SPLEN20043430, SPLEN20043460, 
SPLEN20062830, SPLEN2006701 0, SPLEN200761 90, SPLEN20081640, 
SPLEN201 08460, SPLEN2011 0210, SPLEN20111450, SPLEN20114190, 
SPLEN201 37530, SPLEN201 92570, SPLEN201 93750, SPLEN201 97090, 
SPLEN20200070, SPLEN20200340, SPLEN20203590, SPLEN20205120, 
TESTI20005980, TESTI20012360, TESTI200 19590, TESTI20028020, 
TESTI20035330, TESTI20040850, TESTI20045740, TESTI20049990, 
TESTI20053800, TESTI20059330, TESTI20059370, TESTI20059790, 



TESTI20062180 
TESTI20067350 
TESTI20085670 

TESTI20094620, TESTI20097270 ; 



TESTI201 22440, TESTI201 24440, 
TESTI201 42480, TESTI201 51 050 , 



TESTI201 67580, 
TESTI201 84280, 



TESTI20062580, TESTI20064990, TESTI200661 70, 
TESTI20068940, TESTI20076920, TESTI20079060, 
TESTI20089290, TESTI200901 80, 
TESTI201 07340, 
TESTI201 25440, 
TESTI201 52490, 
TESTI201 70890, 



TEST1201 70690, 



TESTI20090970, 
TESTI20113150, 
TESTI201 32680, 
TESTI201 59380, 
TESTI20173110, 



TESTI201 84820, TESTI201 92570, TESTI201 93520, 



FCBBF30001020, 
FCBBF30056980, 
FCBBF30081000 : 
FCBBF30101300, 
FCBBF301 28420, 
FCBBF301 94370, 
FEBRA20039070, 
FEBRA20083410, 
FEBRA20098040, 
FEBRA201 75020, 
HLUNG20010130, 
HLUNG20025620, 
HLUNG20037780, 
HLUNG20059240, 
HLUNG20083840, 
KIDNE20086970, 
KIDNE20141120, 
KIDNE20181670, 
MESAN20029780, 
MESAN20089260, 
NT2NE20047870, 
NT2NE20086070, 
NT2NE201 23610, 
NT2NE201 50610, 
NT2RP70049610, 
NTONG20049180, 
OCBBF201 17220, 
PLACE60011180, 
PLACE60072390, 
PLACE60092370, 
PLACE601 57310, 
PROST20011800, 
PROST20041460, 
PROST20060200, 
PROST201 02500, 
PROST201 56360, 
SKNSH20040390, 
SMINT20021260, 
SMINT20056240, 
SMINT20085450, 
SPLEN20005370, 
SPLEN20023430, 
SPLEN20045550, 
SPLEN20087370, 
SPLEN201 17580, 
SPLEN201 97740, 
TESOP1 0001 600, 
TESTI20030610, 
TESTI20050170, 
TESTI20060150, 
TESTI20066280, 
TESTI20080460, 
TESTI20091360, 
TESTI201 17500, 
TESTI201 34010, 
TESTI20161010, 
TESTI201 79510, 
TESTI201 97290, 
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TESTI201 98600, TESTI201 99980, TESTI202001 20, TESTI20200840, TESTI202051 50, TESTI2021 1 380, 
TEST12021 9390, TESTI20221 790, TESTI20223380, TESTI20248850, TESTI20254090, TESTI20254480, 
TESTI20259200, TESTI20260140, TESTI20265150, TESTI20265340, TESTI20265890, TESTI20269250, 
TESTI20269360, TESTI20272380, TESTI20277300, TESTI20287760, THYMU20007750, THYMU20008000, 

5 THYMU20009460, THYMU20009710, THYMU20010710, THYMU2001 2560, THYMU20014430, THYMU2001 8390, 
THYMU20019000, THYMU20020370, THYMU20021540, THYMU20029830, THYMU20036500, THYMU20043440, 
THYMU20043560, THYMU20044100, THYMU20044520, THYMU20051340, THYMU20058550, THYMU20060480, 
THYMU20062520, THYMU20064680, THYMU20069130, THYMU20069460, THYMU20069650, THYMU20071460, 
THYMU20072580, THYMU20073070, THYMU200 73080, THYMU20078020, THYMU20080490, THYMU20083500, 

w THYMU20084520, THYMU20086430, THYMU20089170, THYMU20089900, THYMU20091040, THYMU201 12570, 
THYMU20115730, THYMU201 17850, THYMU201 28910, THYMU201 29020, THYMU201 30470, THYMU201 34260, 
THYMU201 40510, THYMU201 48010, THYMU201 49230, THYMU201 57620, THYMU201 74490, THYMU201 74790, 
THYMU201 75260, THYMU201 77070, THYMU20181890, THYMU201 87210, TKIDN10001710, TRACH20012490, 
TRACH20021000, TRACH20025370, TRACH20026640, TRACH20041090, TRACH20044990, TRACH20049500, 

15 TRACH20051590, TRACH20057200, TRACH20080810, TRACH20093480, TRACH20101590, TRACH201 23870, 
TRACH201 24970, TRACH201 25620, TRACH20129180, TRACH20140180, TRACH201 58240, TRACH201 60800, 
TRACH201 74980, TRACH201 82780, TRACH201 85120, UTERU1 0001 870, UTERU20000230, UTERU20011760, 
UTERU20013890, UTERU20027360, UTERU20029930, UTERU20031350, UTERU20040370, UTERU20040390, 
UTERU20040730, UTERU20041970, UTERU20065470, UTERU20079240, UTERU20090940, UTERU20091470, 

20 UTERU201 02260, UTERU201 03040, UTERU201 06510, UTERU201 40010, UTERU201 67570, UTERU201 73030, 
UTERU201 76230 



EXAMPLE 7 



25 Expression frequency analysis in silico 

[0267] The cDNA libraries derived from various tissues and cells as indicated in Example 1 were prepared, and 
cDNA clones were selected from each library at random. The 5'-end sequences were determined and the database 
was constructed based on the data. The database was constructed based on the nucleotide sequences of 770,546 

30 clones, and thus the population of the database is large enough for the analysis. 

[0268] Then, clones having a homologous sequence are categorized into a single cluster (clustering) by searching 
the nucleotide sequences of respective clones in this database with the program of nucleotide sequence homology 
search; the number of clones belonging to each cluster was determined and normalized for every library; thus, the 
ratio of a certain gene in each cDNA library was determined. This analysis gave the information of the expression 

35 frequency of genes in tissues and cells which were sources of the cDNA libraries. 

[0269] Then, in order to analyze the expression of a gene containing the nucleotide sequence of the cDNA of the 
present invention in tissues and cells, the library derived from a tissue or a cell used in the large-scale cDNA analysis 
was subjected to the comparison of the expression levels between tissues or cells. Namely, the expression frequency 
was analyzed by comparing the previously normalized values between tissues and/or cells for which the nucleotide 

40 sequences of 600 or more cDNA clones had been analyzed. By this analysis, some of the genes were revealed to be 
involved in the pathology and functions indicated below. Each value in Tables 3 to 39 shown below represents a relative 
expression frequency; the higher the value, the higher the expression level. 



Osteoporosis-related genes 

45 

[0270] Osteoporosis is a pathology in which bones are easily broken owing to overall decrease in components of 
bone. The onset involves the balance between the functions of osteoblast producing bone and osteoclast absorbing 
bone, namely bone metabolism. Thus, the genes involved in the increase of osteoclasts differentiating from precursor 
cells of monocyte/macrophage line (Molecular Medicine 38. 642-648. (2001)) are genes involved in osteoporosis rel- 

50 evant to bone metabolism. 

[0271] A nucleotide sequence information-based analysis was carried out to identify the genes whose expression 
frequencies are higher or lower in CD34+ cell (cell expressing a glycoprotein CD34) treated with the osteoclast differ- 
entiation factor (Molecular Medicine 38. 642-648. (2001)) than in the untreated CD34+ceII, which is the precursor cell 
of monocyte/macrophage line. The result of comparative analysis for the frequency between the two cDNA libraries 

55 prepared from the RNA of CD34+ cells (CD34C) and from the RNA of CD34+ cells treated with the osteoclast differ- 
entiation factor (D30ST, D60ST or D90ST) showed that the genes whose expression levels were different between the 
two were the following clones (Table 3). 

ASTRO20010290, BRAMY20036530, BRAMY20043630, BRAMY20089770, BRAMY201 90550, CD34C20001750, 
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FCBBF20066340, FEBRA20040290, HLUNG20015180, HLUNG20041590, HLUNG20052300, KIDNE20084040, 
MESAN20021860 ! MESAN20027240, NTONG20055200, PROST2001 6760, PUAEN1 0001 640, SMINT20006020, 
SMINT20028840, SMINT20035050, SPLEN201 81570, TESTI20064530, TESTI20210030, THYMU20029830, 
THYMU20139160, TRACH20051590 
5 [0272] These genes are involved in osteoporosis. 

Genes involved in neural cell differentiation 

[0273] Genes involved in neural cell differentiation are useful for treating neurological diseases. Genes with varying 
10 expression levels in response to induction of cellular differentiation in neural cells are thought to be involved in neuro- 
logical diseases. 

[0274] A survey was performed for genes whose expression levels are varied in response to induction of differenti- 
ation (stimulation by retinoic acid (RA) or growth inhibitor treatment after RA stimulation) in cultured cells of a neural 
strain, NT2. The result of comparative analysis of cDNA libraries derived from undifferentiated NT2 cells (NT2RM) and 
15 the cells subjected to the differentiation treatment (NT2RP, NT2RI or NT2NE) showed that the genes whose expression 
levels were different between the two were the following clones (Table 4). 

ADRGL20023920, ASTR020009140, BNGH420077980, BNGH420086030, BRACE20062580, BRACE20079370, 
BRACE20215410, BRAMY2 00 03540, BRAMY20043630, BRAMY20076130, BRAMY20095080, BRAMY20227860, 
BRAWH20082550, BRHIP1 0001 040, BRSSN2000561 0, CTONG20027660, CTONG20044230, CTONG20066110, 

20 CTONG20079590, CTONG20084660, CTONG201 33720, CTONG201 65750, CTONG201 88080, FCBBF20023490, 
FCBBF20033360, FCBBF20059660, FCBBF20070950, FCBBF30004340, FCBBF30095410, FCBBF30 125460, 
FCBBF30179180, FCBBF30236670, FCBBF30257370, FCBBF50000610, FCBBF50001650, FEBRA20038330, 
FEBRA20039260, FEBRA20063720, FEBRA20090220, FEBRA201 50420, HEART1 0001490, HLUNG20032460, 
HLUNG20041590, KIDNE20089870, MESAN2001 6270, MESAN20021860, MESAN20060430, MESAN200 67430, 

25 NT2NE20018740, NT2NE2001 8890, NT2NE20021 860, NT2NE20026200, NT2NE2002651 0, NT2NE20028700, 
NT2NE20033150, NT2NE20037050, NT2NE20038870, NT2NE20039210, NT2NE20042550, NT2NE20045190, 
NT2NE20047870, NT2NE20053230, NT2NE20053950, NT2NE2005921 0, NT2NE20059680, NT2NE20060750, 
NT2NE20061030, NT2NE20062880, NT2NE20064780, NT2NE20066590, NT2NE20069580, NT2NE20070520, 
NT2NE20073650, NT2NE20077250, NT2NE20077270, NT2NE20077860, NT2NE20079670, NT2NE20080770, 

30 NT2NE20082130, NT2NE20082600, NT2NE20086070, NT2NE20087270, NT2NE20087850, NT2NE20088030, 
NT2NE20092950, NT2NE20095230, NT2NE20 104000, NT2NE201 0781 0, NT2NE201 08420, NT2NE20111190, 
NT2NE20112210, NT2NE20114850, NT2NE2011 7580, NT2NE20119980, NT2NE2012361 0, NT2NE201 24570, 
NT2NE201 26030, NT2NE201 27900, NT2NE20140130, NT2NE20 140280, NT2NE201 41 040, NT2NE20 145250, 
NT2NE201 46510, NT2NE201 48690, NT2NE20 149500, NT2NE201 50610, NT2NE201 52620, NT2NE20 153620, 

35 NT2NE201 55650, NT2NE20157120, NT2NE201 65190, NT2NE201 67660, NT2NE201 73970, NT2NE201 7721 0, 
NT2NE201 81 760, NT2NE201 81 800, NT2NE201 84720, NT2RI2001 6240, NT2RI20021 200, NT2RI20033920, 
NT2RI20093010, NT2RP70001120, NT2RP70001 730, NT2RP70003110, NT2RP70012830, NT2RP70022820, 
NT2RP70027790, NT2RP70029780, NT2RP 70 030840, NT2RP70031070, NT2RP70031340, NT2RP70031480, 
NT2RP70035110, NT2RP7004641 0, NT2RP7004961 0, NT2RP70056290, NT2RP70056690, NT2RP70057500, 

40 NT2RP70064570, NT2RP70074800, NT2RP 70 075300, NT2RP70075800, NT2RP70080150, NT2RP70084540, 
NT2RP70087140, NT2RP70090870, OCBBF20001 780, OCBBF20009820, OCBBF201 42290, OCBBF201 55030, 
OCBBF201 75360, OCBBF201 77540, OCBBF201 7791 0, P LAC E60 054820, PLACE60061370, PLACE60073090, 
PLACE60162100, PROST20011800, PROST20045700, PROST2007871 0, PROST20094000, PUAEN10000650, 
PUAEN1 0001 640, SKNMC20006350, SMINT2001 61 50, SMINT20030740, SMINT20035510, SMINT20039050, 

45 SMINT20047290, SPLEN20063250, SPLEN201 1 7580, SPLEN20125230, TESTI20030610, TESTI20043910, 
TESTI20066280, TESTI20067480, TESTI201 051 30, TESTI201 061 70, TESTI201 431 80, TESTI20221 790, 
TESTI20254090, TESTI20274960, THY MU1 0004280, THYMU20007020, THYMU201 04480, THYMU201 391 60, 
TRACH20026640, UTERU1 0001 060, UTERU20026620, UTERU20079240, UTERU20083020, UTERU20 102260, 
UTERU201 32620 

50 [0275] These genes are neurological disease-related genes. 

Cancer-related genes 

[0276] It has been assumed that, distinct from normal tissues, cancer tissues express a distinct set of genes, and 
55 thus the expression can contribute to the carcinogenesis in tissues and cells. Thus, the genes whose expression 
patterns in cancer tissues are different from those in normal tissues are cancer-related genes. Search was carried out 
for the genes whose expression levels in cancer tissues were different from those in normal tissues. 
[0277] The result of comparative analysis of cDNA libraries derived from breast tumor (TBAES) and normal breast 
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(BEAST) showed that the genes whose expression levels were different between the two were the following clones 
(Table 5). 

CTONG20070780, CTONG20084660, HLUNG20045340, TESTI20047370 

[0278] The result of comparative analysis of cDNA libraries derived cervical tumor (TCERX) and normal cervical duct 
5 (CERVX) showed that the genes whose expression levels were different between the two were the following clones 
(Table 6). 
SMINT20030740 

[0279] The result of comparative analysis of cDNA libraries derived from colon tumor (TCOLN) and normal colon 
(COLON) showed that the genes whose expression levels were different between the two were the following clones 
10 (Table 7). 

UTERU2004037 

[0280] The result of comparative analysis of cDNA libraries derived from esophageal tumor (TESOP) and normal 
esophagus (N ESOP) showed that the genes whose expression levels were different between the two were the following 
clones (Table 8). 

15 HLUNG20015180, NESOP20004520, NESOP20005040, TESOP 10000350, TESOP1 0001600, THYMU20071120 
[0281] The result of comparative analysis of cDNA libraries derived from kidney tumor (TKIDN) and normal kidney 
(KIDNE) showed that the genes whose expression levels were different between the two were the following clones 
(Table 9). 

ASTRO20009140, ASTRO20027330, ASTRO20055930, BGGI1 2001 0750, BNGH420074600, BRACE20050870, 

20 BRACE20054480, BRACE20062580, BRACE20219360, BRAMY20003540, BRAMY20003880, BRAMY20043630, 
BRAMY20055760, BRAMY201 25360, BRAMY201 90550, BRAMY20204270, BRAMY20227860, BRAWH20014590, 
BRAWH20093070, BRHIP1 0001040, CTONG20033750, CTONG20039370, CTONG20045500, CTONG20079590, 
FCBBF20023490, FCBBF30004340, FCBBF301 06950, FCBBF3011 5230, FCBBF301 69280, FCBBF30225930, 
FCBBF30282020, FEBRA20038330, FEBRA20039260, FEBRA20040290, FEBRA2008266O, FEBRA201 21200. 

25 FEBRA201 70240, HEART1 0001490, HLUNG20041590, HLUNG20068120, HLUNG20072450, HLUNG20083480, 
HLUNG20083960, KIDNE20011600, KIDNE20016360, KIDNE20024380, KIDNE20027980, KIDNE20080690, 
KIDNE20081170, KIDNE20083150, KIDNE20083620, KIDNE20084030, KIDNE20084040, KIDNE20084730, 
KIDNE20084800, KIDNE20086490, KIDNE20086660, KIDNE20086970, KIDNE20087880, KIDNE20088240, 
KIDNE20089870, KIDNE20091090, KIDNE20094260, KIDNE20094670, KIDNE20095530, KJDNE20133460, 

30 KIDNE20133880, KIDNE20134130, KIDNE201 34890, KIDNE20137310, KIDNE201 38450, KIDNE201 40870, 
KIDNE20141120, KIDNE20141700, KIDNE201 42680, KIDNE201 42680, KIDNE201 42900, KIDNE201 43200, 
KIDNE20147170, KIDNE201 48080, KIDNE20149780, KIDNE201 50730, KIDNE201 52440, KIDNE201 54330, 
KIDNE20154830, KIDNE201 55980, KIDNE20157100, K1DNE201 60360, KIDNE201 60960, KIDNE20163710, 
KIDNE201 65390, KIDNE201 691 80, KIDNE201 70400, KIDNE201 73150, KIDNE201 73430, KIDNE201 76030, 

35 KIDNE20181670, KIDNE201 82540, KIDNE201 861 70, KIDNE201 88630, KIDNE201 89890, KIDNE201 89960, 
KIDNE20191870, MESAN20038520, MESAN20041380, OCBBF20016390, OCBBF201 42290, OCBBF201 74890, 
PLACE60061370, PLACE60073090, PLACE60181 870, PROST2001 6760, PUAEN 10000650, SMINT20039050, 
SMINT20089210, SPLEN2001 761 0, SPLEN20024930, SPLEN20057830, SPLEN20063250, SPLEN20126110, 
SPLEN201 35030, SPLEN201 36700, TESTI 20070740, TESTI202621 50, THYMU20009500, THYMU2001 9260, 

40 THYMU201 57620, TKIDN1 0000620, TKIDN1 0001 71 0, TKIDN1 0001 920, TRACH20011 01 0, UMVEN1 0001 380 

[0282] The result of comparative analysis of cDNA libraries derived from liver tumor (TLIVE) and normal liver (LIVER) 
showed that the genes whose expression levels were different between the two were the following clones (Table 10). 
CTONG20069320, FCBBF30236670, FEBRA20038220, FEBRA20039260, KIDNE20087880, LIVER20006260, 
LIVER20007690, LIVER20007750, LIVER20010510, LIVER20010760, LIVER2001 0990, LIVER20011640, 

45 LIVER20013890, LIVER20026440, LIVER20030650, LIVER20032340, LIVER20038000, LIVER20040740, 
LIVER20055270, MESAN20027240, NT2RI20021 200, SKMUS20006790, TESTI20035330, THYMU1 0004280, 
THYMU20029830 

[0283] The result of comparative analysis of cDNA libraries derived from lung tumor (TLUNG) and normal lung 
(HLUNG) showed that the genes whose expression levels were different between the two were the following clones 
50 (Table 11). 

HLUNG20052300, S M I NT2 0035050, HLUNG20041590, PROST2001 6760, BRAMY20043630, HLUNG2001 51 80, 
THYMU20139160, HLUNG20020850, HLUNG20032460, BRAMY20204270, BRAMY20001510, B RAM Y2022 7860, 
CTONG20029030, CTONG201 68460, CTONG201 86290, FEBRA20039260, FEBRA20078800, FEBRA201 63980, 
HCHON20000870, HLUNG20008460, HLUNG20009260, HLUNG20009550, HLUNG2001 01 30, HLUNG2001 1260, 
55 HLUNG20011440, HLUNG20011460, HLUNG20012140, HLUNG20014590, HLUNG2001 5070, HLUNG20020500, 
HLUNG20021450, HLUNG20023030, HLUNG20024050, HLUNG20025620, HLUNG20028110, HLUNG20029420, 
HLUNG20029490, HLUNG20030420, HLUNG20030490, HLUNG20030610, HLUNG20031620, HLUNG20033060, 
HLUNG20033310, HLUNG20033350, HLUNG20034970, HLUNG200371 40, HLUNG200371 60, HLUNG20037780, 
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HLUNG20038330, HLUNG20041540, HLUNG20042730, HLUNG20045340, HLUNG20047070, HLUNG20050760 3 
HLUNG20051330, HLUNG20054790, HLUNG20055240, HLUNG20056560, HLUNG20057380, HLUNG20059240, 
HLUNG20060670, HLUNG20063700, HLUNG20065700, HLUNG20065990, HLUNG20067810, HLUNG20068120, 
HLUNG20069350, HLUNG2007041 0, HLUNG20072100, HLUNG200721 90, HLUNG20072450, HLUNG20074330, 

5 HLUNG20079310, HLUNG20081390, HLUNG20081530, HLUNG20082350 : HLUNG20083330, HLUNG20083480, 
HLUNG20083840, HLUNG20083960, HLUNG20084790, HLUNG20085210, HLUNG20088750, HLUNG20092530, 
HLUNG20093030, HLUNG20094130, KIDNE20142900, PROST20052850, SKNMC20006350, SPLEN20012450, 
TESTI20057590, TESTI20061 200, TESTI20067480, TESTI201 1 6050, THYMU1 0004280, THYMU2001 01 80, 
TRACH20011010, UTERU2001 6580, UTERU201 27030 

w [0284] The result of comparative analysis of cDNA libraries derived from ovary tumor (TOVER) and normal ovary 
(NOVER) showed the genes whose expression levels were different between the two were the following clones (Table 
12). 

KIDNE20089870, NT2RP 70 075300, TESTI201 32310 

[0285] The result of comparative analysis of cDNA libraries derived from stomach tumor (TSTOM) and normal stom- 
as ach (STOMA) showed that the genes whose expression levels were different between the two were the following clones 
(Table 13). 

BNGH420087430, BRAMY20227860, BRAWH20027250, CTONG2 01 74440, FEBRA20090220, PUAEN1 0000650, 
SMINT20023110, SMINT20030740, S M I NT2 0045890, SPLEN20048800, SPLEN20139360, TESTI2006341 0, 
TESTI201 50920, TRACH20026640, UTERU20041970 
20 [0286] The result of comparative analysis of cDNA libraries derived from uterine tumor (TUTER) and normal uterus 
(UTERU) showed that the genes whose expression levels were different between the two were the following clones 
(Table 14). 

ADRGL20020290, BRACE20038920, BRAMY20091230, B RAM Y2 00 93490, BRAMY20227860, BRHIP20005060, 
CTONG20069320, CTONG20083430, FCBBF30005360, FCBBF30257370, FEBRA20038330, FEBRA20039260, 

25 FEBRA20040260, FEBRA200781 80, FEBRA20087550, HLUNG20015070, HLUNG200151 80, MESAN2000711 0 ; 
MESAN20067430, MESAN20095800, NT2RP 70 057500, SKMUS20008730, SKNMC20006350, SMINT20035050, 
SMINT20045890, SPLEN20073880, SPLEN200 76470, SPLEN201 18050, TESTI20030610, TESTI20035330, 
TEST120057590, TESTI20059080, TESTI20105130, THYMU 10004280, THYMU20139160, UTERU1 0001060, 
UTERU1 0001 870, UTERU20000230, UTERU20000950, UTERU20011 760, UTERU20013890, UTERU2001 6580, 

30 UTERU20026620, UTERU20027360, UTERU20029930, UTERU20031350, UTERU20035770, UTERU20040150, 
UTERU20040370, UTERU20040390, UTERU20040730, UTERU20041630, UTERU20041970, UTERU20045200, 
UTERU20051790, UTERU20064120, UTERU20065470, UTERU20079240, UTERU20083020, UTERU20086530, 
UTERU20087070, UTERU20087850, UTERU20089300, UTERU20089390, UTERU20089620, UTERU20090940, 
UTERU20091470, UTERU20094830, UTERU200951 00, UTERU20099040, UTERU20099510, UTERU20101150, 

35 UTERU201 02260, UTERU20 103040, UTERU201 03200, UTERU201 04310, UTERU201 06510, UTERU201 21 140, 
UTERU201 22520, UTERU20125810, UTERU20127030, UTERU201 27150, UTERU201 28560, UTERU201 32620, 
UTERU201 34830, UTERU201 39760, UTERU20 140010, UTERU201 67570, UTERU201 68960, UTERU201 69020, 
UTERU201 73030, UTERU201 76230, UTERU20177150, UTERU201 81270, UTERU201 85220, UTERU201 88670, 
UTERU201 88840 

40 [0287] The result of comparative analysis of cDNA libraries derived from tongue cancer (CTONG) and normal tongue 
(NTONG) showed that the genes whose expression levels were different between the two were the following clones 
(Table 15). 

ADRGL20023920, BRACE20038920, BRACE20050870, BRACE20061 620, BRAMY20036530, BRAMY20076130, 
BRAMY20204270, BRAMY20267780, BRCAN20001680, CTONG1 0000090, CTONG20000340, CTONG20002790, 

45 CTONG20004120, CTONG20004520, CTONG20007660, CTONG200081 90, CTONG20008460, CTONG20015240, 
CTONG20017490, CTONG20020660, CTONG20020950, CTONG20027660, CTONG20029030, CTONG20030280, 
CTONG20031150, CTONG20031 890, CTONG20032930, CTONG20033500, CTONG2003361 0, CTONG2 0033 750, 
CTONG20035240, CTONG2 0036800, CTONG20036990, CTONG20039370, CTONG200411 50, CTONG20041 260, 
CTONG20042640, CTONG20044230, CTONG20044870, CTONG20045500, CTONG20046690, CTONG2 0049480, 

so CTONG20050490, CTONG20051100, CTONG20051450, CTONG20052780, CTONG20053990, CTONG20055670, 
CTONG20055850, CTONG20056150, CTONG20057750, CTONG20057950, CTONG200591 30, CTONG20060040, 
CTONG20061290, CTONG20062730, CTONG20063770, CTONG20063930, CTONG20065240, CTONG20065680, 
CTONG20066110, CTONG20068360, CTONG20069320, CTONG20069420, CTONG20070090, CTONG20070720, 
CTONG20070780, CTONG20070910, CTONG20071 040, CTONG20071680, CTONG20072930, CTONG20073990, 

55 CTONG20074000, CTONG2 00741 70, CTONG20074740, CTONG20076230, CTONG2007681 0, CTONG20077760, 
CTONG20078340, CTONG20079590, CTONG20080140, CTONG20081 840, CTONG20083430, CTONG20083980, 
CTONG20084020, CTONG20084660, CTONG2008521 0, CTONG201 33720, CTONG201 65590, CTONG201 65750, 
CTONG201 66580, CTONG201 67750, CTONG201 68240, CTONG201 68460, CTONG201 69040, CTONG201 69530, 
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CTONG201 70940, CTONG201 74290, CTONG201 74580, CTONG2 01 76040, CTONG201 79390, CTONG201 79890, 
CTONG201 79980, CTONG201 80620, CTONG201 80690, CTONG201 81350, CTONG201 83430, CTONG201 83830, 
CTONG20184130, CTONG201 84830, CTONG201 86140, CTONG20 186290, CTONG201 86370, CTONG201 86520, 
CTONG201 86550, CTONG201 88080, CTONG201 89000, CTONG201 90290, CTONG201 90630, FCBBF20070950, 

5 FCBBF30001100, FCBBF301 75350, FCBBF40005000, FEBRA20027070, FEBRA20038330, FEBRA20039260, 
FEBRA20040290, FEBRA20046200, FEBRA20063720, FEBRA20078800, FEBRA20090220, HCHON20000870, 
HLUNG20068120, MESAN20008150, MESAN20027900, NT2NE20153620, NT2RP70001 730, NT2RP70012830, 
NT2RP70027790, NT2RP70057500, NT2RP70064570, NT2RP70090870, NTONG20002230, NTONG20005310, 
NTONG20017620, NTONG20029850, NTONG20031580, NTONG20032100, NTONG20034540, NTONG20035150, 

10 NTONG20043080, NTONG20048440, NTONG20049180, NTONG20053630, NTONG20053730, NTONG20053910, 
NTONG20055200, NTONG20058010, NTONG20058220, OCBBF201 10730, OCBBF201 77540, OCBBF201 77910, 
PROST20016760, PROST20042700, PROST20050390, PROST200 63430, PROST201 30320, PUAEN1 0000650, 
PUAEN1 0001 640, PUAEN20003120, SKMUS20006790, SKNMC20006350, SKNSH200071 60, SMINT20030740, 
SMINT200355I0, SMINT20089210, SPLEN20024930, SPLEN20040780, SPLEN20063250, SPLEN20181570, 

15 SPLEN201 87490, TEST120047370, TESTI20057880, TESTI20064530, TESTI20079980, TESTI201 05130, 
TESTI201 18460, TESTI201 21 040, TESTI201 97290, THYMU1 0004280, THYMU20030460, THYMU20055460, 
THYMU20089900, THYMU20121040, THYMU20139160, THYMU201 45990, TRACH20011010, TRACH20090060, 
UTERU20000230, UTERU20000950, UTERU20016580, UTERU20045200, UTERU20083020 
[0288] These genes are involved in cancers. 

20 [0289] Further, there is a method to search for genes involved in development and differentiation: the expression 
frequency analysis in which the expression levels of genes are compared between developing or differentiating tissues 
and/or cells and adult tissues and/or cells. The genes involved in tissue development and/or differentiation are genes 
participating in tissue construction and expression of function, and thus are useful genes, which are available for re- 
generative medicine aiming at convenient regeneration of injured tissues. 

25 [0290] Search was carried out for the genes whose expression frequencies were different between developing and/ 
or differentiating tissues and/or cells, and adult tissues and/or cells, by using the information of gene expression fre- 
quency based on the database of the nucleotide sequences of 770,546 clones shown above. 

[0291] The result of comparative analysis of cDNA libraries derived from fetal brain (FCBBF, FEBRA or OCBBF) and 
adult brain (BRACE, BRALZ, BRAMY, BRAWH, BRCAN, BRCOC, BRHIP, BRSSN, BRSTN or BRTHA) showed that 

30 the genes whose expression levels were different between the two were the following clones (Tables 1 6 to 36). 

ADRGL20020290, ADRGL20021910, ADRGL20023920, ADRGL20046760, ADRGL20062330, ADRGL20079060, 
ASTRO20009140, ASTRO20 020240, ASTRO20027330, ASTRO20047510, ASTRO20055530, ASTRO20055570, 
ASTRO20055930, ASTRO20090680, BGGI120010750, BNGH420021 680, BNGH420 023870, BNGH420059680, 
BNGH420074600, BNGH420086030, BRACE1 000051 0, BRACE2000331 0, BRACE20007330, BRACE20009050, 

35 BRACE20014450, BRACE20017790, BRACE20018810, BRACE20025820, BRACE20038920, BRACE20050870, 
BRACE20051600, BRACE20051930, BRACE20052430, BRACE20052530, BRACE20054080, BRACE20054480, 
BRACE20054600, BRACE20055560, BRACE20057870, BRACE20059110, BRACE20059810, BRACE20061620, 
BRACE20062580, BRACE20063540, BRACE20065470, BRACE20066360, BRACE20068710, BRACE20069000, 
BRACE20069110, BRACE20069440, BRACE20079200, BRACE20079370, BRACE20097540, BRACE20098860, 

40 BRACE20099070, BRACE201 94670, BRACE20196180, BRACE201 96960, BRACE20200770, BRACE20200970, 
BRACE20204670, BRACE20205840, BRACE20207420, BRACE20212450, BRACE2021 541 0, BRACE2021 6700, 
BRACE20216950, BRACE20219360, BRAMY1 0000980, BRAMY1 0001 730, BRAMY2000021 0, BRA MY200 00250, 
BRAMY20001510, BRAMY20003540, BRAMY20003880, BRAMY20005080, BRAMY20013670, BRAMY2001 6780, 
BRAMY20020440, BRAMY20021580, BRAMY20023390, BRAMY20023640, BRAMY20024790, BRAMY20027390, 

45 BRAMY20027990, BRAMY20028530, BRAMY20028620, BRAMY20035380, BRAMY20035830, BRAMY20036530, 
BRAMY20036810, BRAMY20038980, BRAMY20039290, BRAMY20040580, BRAMY20043520, BRAMY20043630, 
BRAMY20044920, BRAMY2004521 0, BRAMY20045420, BRAMY20047560, BRAMY20050640, BRAMY20050940, 
BRAMY20051820, BRAMY20052440, BRAMY20053910, BRAMY20055760, BRAMY20056620, BRAMY20056840, 
BRAMY20063750, BRAMY20072440, BRAMY200 72870, BRAMY20073080, BRAMY2007411 0, BR AM Y20 074 860, 

50 BRAMY200761 00, BRAMY200761 30, BRAMY20076530, BRAMY20083330 ; BRAMY20083820, BRAMY20089770, 
BRAMY20091230, BRAMY20093490, BRAMY20094890, BRAMY20095080, BRAMY20095570, BRAMY20096930, 
BRAMY201 00680, BRAMY201 02900, BRA MY201 07980, BRAMY20111780, BRAMY2011 7670, BRAMY2011841 0, 
BRAMY201 18490, BRAMY20120170, BR AM Y20 123400, BRAMY201 24970, BRAMY201251 70, BRAMY201 25360, 
BRAMY201 25550, BRAMY201 26910, B RAM Y20 127310, BRAMY201 27760, B RAM Y20 134050, BRAMY201 35720, 

55 BRAMY201 37360, BRAMY201 39440, BRAMY201 39750, BRAMY201 43870. BRAMY2015251 0, BRAMY201 55500, 
BRAMY201 58550, BRAMY201 59250, BRAMY201 60020, BRAMY201 73480, BRAMY201 90550, BRAMY201 94680, 
BRAMY20204270, BRAMY20206340, BRAMY202 19620, BRAMY20221 600, BRAMY2022301 0, BRAMY20225250, 
BRAMY20225320, BRAMY20227230, BRAMY20227860, BRAMY20227960, BRAMY20231 150, BRAMY20234820, 
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BRAMY20237190, 
BRAMY20245760, 
BRAMY20271140, 
BRAWH20020600, 

5 BRAWH20039640, 
BRAWH20055780, 
BRAWH20082550, 
BRAWH201 73790, 
BRAWH20183170, 

10 BRAWH201 90550, 
BRCAN20004180, 
BRHIP1 0000720, 
BRSSN20005610, 
BRSSN20093890, 

15 CTONG20070720, 
CTONG20080140, 
CTONG201 83430, 
FCBBF20018680, 
FCBBF20032930, 

20 FCBBF20038950, 
FCBBF20061310, 
FCBBF30001100, 
FCBBF30004340, 
FCBBF30019180, 

25 FCBBF30035570, 
FCBBF30063990, 
FCBBF30074620, 
FCBBF30081000, 
FCBBF30093170, 

30 FCBBF30101240, 
FCBBF301 07290, 
FCBBF301 18670, 
FCBBF301 30410, 
FCBBF301 38000, 

35 FCBBF301 57270, 
FCBBF301 70710, 
FCBBF30179180, 
FCBBF301 95700, 
FCBBF30220050, 

40 FCBBF30230610, 
FCBBF30260210, 
FCBBF30282020, 
FCBBF50000410, 
FEBRA20007820, 

45 FEBRA20031150, 
FEBRA20038330, 
FEBRA20040290, 
FEBRA20057010, 
FEBRA20082660, 

50 FEBRA20088810, 
FEBRA20093280, 
FEBRA201 08580, 
FEBRA201 50420, 
FEBRA201 75020, 

55 FEBRA20191720, 
HLUNG20014590, 
HLUNG20032460, 
HLUNG20081390, 



BRAMY20238630, 
BRAMY20251210, 
BRAMY20274510, 
BRAWH20021910, 
BRAWH20040680, 
BRAWH20058120, 
BRAWH20082920, 
BRAWH201 74330, 
BRAWH201 85260, 
BRAWH20191980, 
BRCAN20005230, 
BRHIP10001040, 
BRSSN20005660, 
CTONG20032930, 
CTONG20071040, 
CTONG20085210, 
CTONG201 86370, 
FCBBF20020440, 
FCBBF20033360, 
FCBBF20041380, 
FCBBF20066340, 
FCBBF30001150, 
FCBBF30004730, 
FCBBF30019240, 
FCBBF30042610, 
FCBBF30068210, 
FCBBF30075970, 
FCBBF30085560, 
FCBBF30095410, 
FCBBF30101300, 
FCBBF301 07330, 
FCBBF301 18890, 
FCBBF301 30580, 
FCBBF301 42290, 
FCBBF30161780, 
FCBBF301 71230, 
FCBBF301 79740, 
FCBBF301 97840, 
FCBBF30222910, 
FCBBF30236670, 
FCBBF30260480, 
FCBBF30285930, 
FCBBF50000610, 
FEBRA20018670, 
FEBRA20031280, 
FEBRA20038970, 
FEBRA20040560, 
FEBRA20063720, 
FEBRA20083410, 
FEBRA20090160, 
FEBRA20095410, 
FEBRA201 15930, 
FEBRA20151750, 
FEBRA201 75330, 
HCHON20002650, 
HLUNG20015070, 
HLUNG20033060, 
HLUNG20083480, 



BRAMY20243120, 
BRAMY20251750, 
BRAMY20285650, 
BRAWH20025490, 
BRAWH20047790, 
BRAWH20063010, 
BRAWH20093040, 
BRAWH201 75230, 
BRAWH201 85270, 
BRCAN 10000760, 
BRCAN20005410, 
BRHIP20000210, 
BRSSN20066440, 
CTONG20035240, 
CTONG20071680, 
CTONG201 33720, 
CTONG201 88080 
FCBBF20021110, 
FCBBF20035430, 
FCBBF20043730, 
FCBBF20070800, 
FCBBF30002270, 
FCBBF30005180, 
FCBBF30021900, 
FCBBF30048420, 
FCBBF30071500, 
FCBBF30076310, 
FCBBF30088700, 
FCBBF30099490, 
FCBBF301 05080, 
FCBBF30114180, 
FCBBF301 25460, 
FCBBF301 32050, 
FCBBF301 43550, 
FCBBF301 64510, 
FCBBF301 72330, 
FCBBF30181730, 
FCBBF301 98670, 
FCBBF30223110, 
FCBBF30250980, 
FCBBF30263080, 
FCBBF30287940, 
FCBBF50001650, 
FEBRA20026820, 
FEBRA20031810, 
FEBRA20039070, 
FEBRA20045380, 
FEBRA20076200, 
FEBRA20084750, 
FEBRA20090220, 
FEBRA20098040, 
FEBRA201 16650, 
FEBRA201 63980, 
FEBRA201 77800, 
HCHON20002710, 
HLUNG20015180, 
HLUNG20041590, 
HLUNG20085210, 



BRAMY20244490, BRAMY20245140, 
BRAMY20263000, BRAMY20267780 
BRAMY20287400, BRAWH20014590 
BRAWH20026010, BRAWH20027250 
BRAWH20050740, BRAWH20055240 
BRAWH20078080, BRAWH20078620 
BRAWH20093070, BRAWH20094900 
BRAWH201 75340, BRAWH201 76850 
BRAWH201 8601 0, BRAWH201 88750 

BRCAN10001050, BRCAN1 0001680 
BRCOC1 0000400, BRCOC20000470 

BR H I P20003590, BRH I P20005060 , 

BRSSN20074640, BRSSN20091190 
CTONG20044870, CTONG20063930 
CTONG20074170, CTONG20078340 
CTONG201 65750, CTONG201 68240 
FCBBF1 0000230, FCBBF1 0002200 

FCBBF20023490 



FCBBF20035490, 
FCBBF20054390, 
FCBBF20070950, 
FCBBF30002280, 
FCBBF30005360, 
FCBBF30022680, 
FCBBF30053300, 
FCBBF30072440, 
FCBBF30078600, 
FCBBF30089380, 
FCBBF301 00080, 
FCBBF301 05440, 
FCBBF30114850, 
FCBBF301 25880, 
FCBBF301 32660, 
FCBBF301 45670, 
FCBBF301 66220, 
FCBBF301 73960, 
FCBBF301 94370, 
FCBBF30201630, 
FCBBF30223210, 
FCBBF30255680, 
FCBBF30266510, 
FCBBF40000610, 
FCBBF50003530, 
FEBRA20027070, 
FEBRA20035200, 
FEBRA20039260, 
FEBRA20046200, 
FEBRA20078180, 
FEBRA20086600, 
FEBRA20091620, 
FEBRA20099860, 
FEBRA20121200, 
FEBRA201 70240, 



FCBBF20028980 
FCBBF20036360 
FCBBF20056580 
FCBBF30000010 
FCBBF30002330 
FCBBF30005500 
FCBBF30026580 
FCBBF30056980 
FCBBF30072480 
FCBBF30079770 
FCBBF30091010 
FCBBF30100120 
FCBBF301 05860 
FCBBF301 15230 
FCBBF301 28420 
FCBBF301 35890 
FCBBF30151190 : 
FCBBF301 69280 
FCBBF30 175350 
FCBBF30 194550 
FCBBF30212210 
FCBBF30225930 
FCBBF30257370 
FCBBF30271990 
FCBBF40001920 
FCBBF50004950 
FEBRA20029620 
FEBRA20035240 
FEBRA20040230 
FEBRA20046280 
FEBRA20078800 
FEBRA20087550 
FEBRA20092760 
FEBRA20101410 
FEBRA20121950 
FEBRA201 72230 



FEBRA201 80510, FEBRA201 82030 
HEART1 0001 490, HLUNG20008460 
HLUNG20020850, HLUNG20028110 
HLUNG20045340, HLUNG20056560 
HLUNG20094130, KIDNE20080690 



BRAMY20245350, 
BRAMY20269040, 
BRAWH20020470, 
BRAWH20030000, 
BRAWH20055330, 
BRAWH20080580, 
BRAWH20095900, 
BRAWH201 82670, 
BRAWH201 90530, 
BRCAN20001480, 
BRCOC20003600, 
BRSSN20001970, 
BRSSN20092440, 
CTONG20069320, 
CTONG20079590, 
CTONG201 70940, 
FCBBF1 0004760, 
FCBBF20029280, 
FCBBF20038230, 
FCBBF20059660, 
FCBBF30001020, 
FCBBF30003610, 
FCBBF30019140, 
FCBBF30029250, 
FCBBF30062490, 
FCBBF30074530, 
FCBBF30080730, 
FCBBF30091520, 
FCBBF301 00410, 
FCBBF301 06950, 
FCBBF301 15920, 
FCBBF301 29010, 
FCBBF301 36230, 
FCBBF30153170, 
FCBBF301 69870, 
FCBBF301 77290, 
FCBBF301 95690, 
FCBBF30215240, 
FCBBF30228940, 
FCBBF30259050, 
FCBBF30275590, 
FCBBF40005000, 
FEBRA20005040, 
FEBRA20031000, 
FEBRA20038220, 
FEBRA20040260, 
FEBRA20046510, 
FEBRA20080860, 
FEBRA20088610, 
FEBRA20093270, 
FEBRA201 08020, 
FEBRA20141980, 
FEBRA201 73330, 
FEBRA201 87460, 
HLUNG20011460, 
HLUNG20031620, 
HLUNG20068120, 
, KIDNE20084030, 
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KIDNE20086660, KIDNE20094670, KIDNE20134130, KIDNE201 38450, KIDNE201 40870, KIDNE201 49780, 
KIDNE201 70400, KIDNE201 73430, MESAN20021860, MESAN20030350, MESAN 20034440, MESAN20038520, 
MESAN20045750, MESAN20067430, MESAN20089260, MESAN20095800, NT2NE20026200, NT2NE20033150, 
NT2NE20042550, NT2NE20045190, NT2NE20053950, NT2NE20061030, NT2NE20069580, NT2NE20082130, 

5 NT2NE20082600, NT2NE20088030, NT2NE20092950, NT2NE20095230, NT2NE201 08420, NT2NE20111190, 
NT2NE201 12210, NT2NE20141040, NT2NE201 77210, NT2NE20181800, NT2RI20021200, NT2RP70001120, 
NT2RP70001730, NT2RP70012830, NT2RP70035110, NT2RP 70057500, NT2RP70075300, NT2RP70087140, 
NT2RP70090870, NTONG20002230, NTONG2001 7620, NTONG200491 80, NTONG20055200, OCBBF20000740, 
OCBBF20001780, OCBBF20005220, OCBBF20009820, OCBBF20011860, OCBBF200 12520, OCBBF2001 6390, 

10 OCBBF20016810, OCBBF201 09450, OCBBF201 09780, OCBBF201 10210, OCBBF201 10730, OCBBF20111370, 
OCBBF20111600, OCBBF201 12280, OCBBF2011 2320, OCBBF20113110, OCBBF201 15360, OCBBF201 16250, 
OCBBF201 17220, OCBBF2011 8720, OCBBF20119810, OCBBF201 20010, OCBBF201 20950, OCBBF20121910, 
OCBBF201 23200, OCBBF201 42290, OCBBF201 47070, OCBBF20 152330, OCBBF201 55030, OCBBF201 56450, 
OCBBF201 57970, OCBBF20 160380, OCBBF201 65900, OCBBF201 65910, OCBBF201 66890, OCBBF201 66900, 

15 OCBBF201 67290, OCBBF20 170350, OCBBF201 74580, OCBBF201 74890, OCBBF201 75360, OCBBF201 76650, 
OCBBF201 77540, OCBBF20 177910, OCBBF201 82060, OCBBF201 85630, OCBBF201 88280, OCBBF20191950, 
PLACE60054820, PLACE60056910, PLACE60061370, PLACE60064740, PLACE60073090, P LAC E60 120280, 
PLACE601 32200, PLACE6015051 0, PLACE601 54450, PLACE601 57310, PLACE60162100, PROST1 00021 50, 
PROST20014150, PROST2001 6760, PROST2 0024250, PROST200351 70, PROST20035830, PROST20042700, 

20 PROST20045700, PROST20050390, PROST20 054660, PROST2007871 0, PROST20094000, PROST2009731 0, 
PROST20097840, PROST201 03820, PROST2011 41 00, PROST20 130320, PROST20151370, PUAEN1 0000650, 
PUAEN1 0001 640, PUAEN200031 20, SKNMC20006350, SKNSH10001 01 0, SKNSH200071 60, SKNSH20030640, 
SKNSH20094350, SMINT20000070, SMINT20002320, SMINT20030740, SMINT20039050, SMINT20045890, 
SMINT20047290, SMINT20048720, SMINT20056240, SMINT20077920, SMINT20088690, SMINT2008921 0 : 

25 SMINT20089600, SMINT20094150, SPLEN200051 60, SPLEN20005370, SPLEN20012450, SPLEN20024930, 
SPLEN20040780, SPLEN20048800, SPLEN20055600, SPLEN20057830, SPLEN20063250, SPLEN20071820, 
SPLEN20073880, SPLEN20076470, SPLEN201 04690, SPLEN20114190, SPLEN201 25230, SPLEN201 35030, 
SPLEN201 36700, SPLEN201 75920, SPLEN20181570, SPLEN201 83020, SPLEN201 87490, SPLEN201 93490, 
SPLEN201 93790, SPLEN201 97740, SPLEN20200070, SPLEN20200340, TESOP1 0000350, TESTI20005980, 

30 TESTI20030440, TESTI2003061 0, TESTI20031410, TESTI20035330, TESTI20047370, TESTI20050400, 
TESTI20050720, TESTI20053780, TEST 1 20057430, TESTI20057590, TESTI20057840, TESTI20057880, 
TESTI20059080, TESTI20061 200, TESTI20062580, TESTI2006341 0, TESTI20064530, TESTI20066280, 
TESTI20067480, TESTI20071 630, TESTI20079980, TESTI20081890, TESTI20089290, TESTI20090180, 
TESTI201 051 30, TESTI201 061 70, TESTI201 21 040, TESTI201 50920, TESTI201 69500, TESTI201 93080, 

35 TESTI20215310, TESTI20221790, TESTI20245860, TESTI20252690, TESTI20254090, TESTI20261160, 
TESTT20262150, TESTI20274960, THYMU20007750, THYMU20009460, THYMU2000971 0, THYMU2001 9260, 
THYMU20028410, THYMU20030460, THYMU20031330, THYMU20043440, THYMU200441 00, THYMU20044520, 
THYMU20049060, THYMU20055460, THYMU20055740, THYMU20071120, THYMU20078020, THYMU20089900, 
THYMU20091040, THYMU201 04480, THYMU201 20240, THYMU201 391 60, THYMU20 143230, THYMU20150190, 

40 THYMU201 57620, THYMU2017601 0, TKIDN1 0001 920, TRACH20012490, TRACH20021 000, TRACH20026640, 
TRACH20058000, TRACH20090060, TRACH201 59390, UMVEN1 0001380, UTERU10001060, UTERU20000230, 
UTERU20000950, UTERU20026620, UTERU20041970, UTERU20065470, UTERU20079240, UTERU20083020, 
UTERU20089300, UTERU20089390, UTERU200951 00, UTERU201 02260, UTERU201 03200, UTERU201 27150, 
UTERU201 28560 

45 [0292] The result of comparative analysis of cDN A libraries derived from fetal heart ( FEHRT) and adu It heart (H E ART) 
showed that the genes whose expression levels were different between the two were the following clones (Table 37). 
BRAMY20043630, BRAMY20072870, BRAMY20227860, BRAWH20093070, BRCAN1 0001 680, FCBBF30053300, 
FEBRA20078800, FEBRA20090220, HCHON20000870, HEART1 0001420, HEART1 0001490, HEART20009590, 
HEART20019310, HEART20022200, HEART20031680, HEART20047640, HEART20063100, HEART20082570, 

50 HLUNG20083960, PLACE60088240, PLACE601 20280, PROST2001 6760, PROST20035170, PROST20062820, 
PROST20 127450, SKMUS20006790, SKMUS20008730, TESTI20270130 

[0293] The result of comparative analysis of cDNA libraries derived from fetal kidney (FEKID) and adult kidney 
(KIDNE) showed that the genes whose expression levels were different between the two were the following clones 
(Table 38). 

55 ASTRO20009140, BGGI1 2001 0750, BRACE20054480, BRACE20062580, BRACE2021 9360, BRAMY20001 51 0, 
BRAMY20003540, BRAMY20003880, BRAMY20043630, BRAMY20204270, CTONG20033750, CTONG20039370, 
CTOIMG20045500, FCBBF20023490, FEBRA20039260, FEBRA20040290, HEART1 0001 490, HLUNG2004159O, 
HLUNG20068120, HLUNG20072450, HLUNG20083960, KIDNE20011 600, KIDNE20016360, KIDNE20024380, 
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KIDNE20027980, KIDNE20080690, KIDNE200811 70, KIDNE20083150, KIDNE20083620, KIDNE20084030, 
KIDNE20084040, KIDNE20084730, KIDNE20084800, KIDNE2008B490, KIDNE20086660, K1DNE20086970, 
KIDNE20087880, KIDNE20088240, KIDNE20089870, KIDNE20091090, KIDNE20094260, KIDNE20094670, 
KIDNE20095530, KIDNE20133460, KIDNE201 33880, KIDNE20134130, KIDNE201 34890, KIDNE20137310, 

5 KIDNE20138450, KIDNE201 40870, KIDNE20141120, KIDNE20141700, KIDNE201 42680, KIDNE201 42900, 
KIDNE201 43200, KIDNE201471 70, KIDNE201 48080, KIDNE20149780, KIDNE201 50730, KIDNE201 52440, 
KIDNE201 54330, KIDNE201 54830, KIDNE20155980, KIDNE201571 00, KIDNE201 60360, KIDNE201 60960, 
KIDN E201 6371 0, KIDNE201 65390, KIDNE201 691 80, KIDNE201 70400, KIDNE201 731 50, KIDNE201 73430, 
KIDNE201 76030, KIDN E201 81 670, KIDNE201 82540, KIDNE201 861 70, K1DNE201 88630, KIDNE201 89890, 

10 KIDNE201 89960, KIDNE20191 870, OCBBF201 74890, PLACE60073090, PLACE60181870, PROST2001 6760, 
PUAEN1 0000650, SKNMC20006350, SPLEN20017610, SPLEN20063250, SPLEN2012611 0, SPLEN201 35030, 
TESTI20061200, TESTI202621 50, THYMU1 0004280, THYMU20139160, TRACH20011010 

[0294] The result of comparative analysis of cDNA libraries derived from fetal lung (FELNG) and adult lung (HLUNG) 
showed that the genes whose expression levels were different between the two were the following clones (Table 39). 

15 BRAMY20001510, BRAMY20043630, BRAMY20204270, BRAMY20227860, CTONG20029030, CTONG201 68460, 
CTONG201 86290, FEBRA20039260, FEBRA20078800, FEBRA20 163980, HCHON20000870, HLUNG20008460, 
HLUNG20009260, HLUNG20009550, HLUNG20010130, HLUNG20011 260, HLUNG20011440, HLUNG20011460, 
HLUNG20012140, HLUNG20014590, HLUNG20015070, HLUNG200151 80, HLUNG20020500, HLUNG20020850, 
HLUNG20021450, HLUNG20023030, HLUNG20024050, HLUNG20025620, HLUNG20028110, HLUNG20029420, 

20 HLUNG20029490, HLUNG20030420, HLUNG20030490, HLUNG2003061 0, HLUNG20031 620, HLUNG20032460, 
HLUNG20033060, HLUNG2003331 0, HLUNG20033350, HLUNG20034970, HLUNG20037140, HLUNG200371 60, 
HLUNG20037780, HLUNG20038330, HLUNG20041540, HLUNG20041590, HLUNG20042730, HLUNG20045340, 
HLUNG20047070, HLUNG20050760, HLUNG20051330, HLUNG20052300, HLUNG20054790, HLUNG20055240, 
HLUNG20056560, HLUNG20057380, HLUNG20059240, HLUNG20060670, HLUNG20063700, HLUNG20065700, 

25 HLUNG20065990, HLUNG2006781 0, HLUNG200681 20, HLUNG20069350, HLUNG2007041 0, HLUNG20072100, 
HLUNG20072190, HLUNG20072450, HLUNG20074330, HLUNG2007931 0, HLUNG20081 390, HLUNG20081530, 
HLUNG20082350, HLUNG20083330, HLUNG20083480, HLUNG20083840, HLUNG20083960, HLUNG20084790, 
HLUNG20085210, HLUNG20088750, HLUNG20092530, HLUNG20093030, HLUNG20094130, KIDNE201 42900, 
PROST20016760, PROST20052850, SKNMC20006350, SMINT20035050, SPLEN20012450, TESTI20057590, 

30 TESTI20061200, TESTI20067480, TESTI201 16050, THYMIM 0004280, THYMU200101 80, THYMU20139160, 
TRACH20011010, UTERU20016580, UTERU201 27030 
[0295] These genes are involved in regeneration of tissues and/or cells. 



35 



EXAMPLE 8 



Expression frequency analysis by PCR 



[0296] Specific PCR primers were prepared based on the full-length nucleotide sequences, and the expression fre- 
quency was analyzed by the ATAC-PCR method (Adaptor-tagged competitive PCR method: Nucleic Acids Research 

40 1997, 25(22): 4694-4696; "DNA Micro-array and Advanced PCR Techniques", Cell Technology, supplement, Eds., 
Muramatsu and Nawa (Shujunsha, 2000): 104-112). Inflammation-related genes can be identified by revealing the 
genes whose expression levels are altered depending on the presence of an inflammation-inducing factor. Then, by 
using THP-1 cell line, which is a cell line of monocyte line, andTNF-a, which is an inflammation-inducing factor, suitable 
for this system, the genes whose expression levels are altered depending on the presence of the factor were searched 

45 for by the system. 

[0297] THP-1 cell line (purchasedfrom DAINIPPON PHARMACEUTICAL) was cultured, to beconfluent in RPMI1 640 
medium (sigma) containing 5% fetal calf serum (GIBCO BRL). Then, the medium was changed with the medium con- 
taining 1 0 ng/ml TNF-a (human recombinant TNF-a; Pharmacia Biotech), and the culture was continued at 37°C under 
5% CO a . After three hours, the cells were harvested, and total RNA was extracted from them by using ISOGEN reagent 
50 (Nippon Gene). The extraction was carried out according to the method in the document attached to ISOGEN reagent. 
In addition, total RNA was also extracted from the cells cultured without stimulation of TNF-a. 

[0298] The genes involved in the onset of gastritis and gastroduodenal ulcer induced by the infection of Helicobacter 
pylorito the epithelia of stomach can be identified by revealing the genes whose expression levels are altered depending 
on co-culturing the cells with Helicobacter pylori. A recent study has suggested that various substances derived from 
55 Helicobacter pylon trigger the inflammation reaction. In particular, the members belonging to the family of genes called 
"cag pathogenicity island (cag PAl)" contribute to the activation of the NF-kB pathway (Gastroenterology 2000, 119: 
97-108). Further, it has been found that cag PAl is involved in the onset of gastritis and the like by the study using an 
animal model (Journal of Experimental Medicine 2000, 192:1 601-1610). Then, by using co-culture of a gastric cancer 
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cell line with cag PAI-positive Helicobacter pylori (TN2), suitable for this system, the genes whose expression levels 
are altered depending on the presence of Helicobacter pylori were searched for by the system. Further, in order to 
study the involvement of cag PAI in the alterations of gene expression levels depending on the co-culture with Helico- 
bacter pylori, the altered expression levels were compared between the cells co-cultured with a strain of Helicobacter 
5 pylori (TN2AcagE strain) having a mutation in cagE, which is one of the cag PAI genes, and the cag PAI-positive strain 
(TN2). 

[0299] A gastric cancer cell line MKN45 (provided by the Cell Bank, RIKEN GENE BANK, The Institute of Physical 
and Chemical Research) was cultured to be confluent in RPMI1640 medium (sigma) containing 10% fetal calf serum 
(GIBCO BRL). Then, the medium was changed with the medium containing 100-fold excess (in terms of the number 

10 of cells or the number of colonies) of Helicobacter pylori (cag PAI positive strain (TN2) and cagE mutant (TN2AcagE): 
both were provided by Prof. Omata, Faculty of Medicine, The University of Tokyo), as compared with the number of 
the cancer cells. The culture was continued at 37°C under 5% C0 2 . After three hours, the cells were harvested, and 
total RNA was extracted from them by using ISOGEN reagent (Nippon Gene). The extraction was carried out according 
to the method in the document attached to ISOGEN reagent. In addition, total RNA was also extracted from the ceils 

'5 cultured without Helicobacter pylori. 

[0300] The analysis by the ATAC-PCR method was carried out basically according to "DNA Micro-array and Ad- 
vanced PCR Techniques", Cell Technology, supplement (Genome Science Series 1 , Eds., Muramatsu and Nawa (Shu- 
junsha, 2000): 104-112). Adapter ligation to the internal standard sample (sample to make the calibration curve for the 
clone of interest) and test sample was carried out in the two separate reaction systems indicated below. The combination 

20 of 6 types of adapters (AD-1 , AD-2, AD-3, AD-4, AD-5 and AD-6: see the sequences indicated below) and the samples 
are as follows. 

Reaction system A 

25 [0301] 

AD1; internal standard, 10-fold 

AD2; THP-1 cells, unstimulated 

AD3; Internal standard, 3-fold 
30 AD4; THP-1 cells, TNF-a stimulation for one hour 

AD5; THP-1 cells, TNF-a stimulation for three hours 

AD6; internal standard, 1-fold 

Reaction system B 

AD1 ; internal standard, 1-fold 
35 AD2; MKN45 cells, unstimulated 

AD3; internal standard, 3-fold 

AD4; MKN45 cells, co-cultured with TN2 (Helicobacter pylori) 
AD5; internal standard, 10-fold 

AD6; MKN45 cells, co-cultured with TN2AcagE (cagE gene mutant) Adapter sequences: 

40 

AD1 ; 

SEQ ID NO: 394 1 / /5 ' -GTACATATTGTCGTTAGAACGCG-3 ■ 
45 SEQ ID NO: 3942 / /3 ' -CATGTATAACAGCAATCTTGCGCCTAG— 5 ' 

AD 2 ; 

SEQ ID NO: 394 3//5 ' -GTACATATTGTCGTTAGAACGCGACT-3 » 

50 

SEQ ID NO: 39 4 4 //3 1 -CATGTATAACAGCAATCTTGCGCTGACTAG-5 ' 
AD3; 

55 SEQ ID NO: 39 4 5// 5 1 -GTACATATTGTCGTTAGAACGCGCATACT-3 * 

SEQ ID NO: 394 6// 3 ' -CATGTATAACAGCAATCTTGCGCGTATGACTAG-5 ' 
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AD 4 ; 

SEQ ID NO: 39 47 / / 5 • -GTACATATTGTGGTTAGAACGCGATCCATACT-3 1 

SEQ ID NO: 394 8 //3 1 ~CATGTATAACAGCAATCTTGCGCTAGGTATGACTAG~5.* 

ADS; 

SEQ ID NO: 3949//S » -GTACATATTGTCGTTAGAACGCGTCAATCCATACT-3 ■ 

SEQ ID NO: 39 50//3 1 -CATGTATAACAGCAATCTTGCGCAGTTAGGTATGACTAG-5 • 

AD 6 ; 

SEQ ID NO: 39 51//5 * -GTACATATTGTCGTTAGAACGCGTACTCAATCCATACT-3 1 



SEQ ID NO: 39S2//3 * -CATGTATAACAGCAATCTTGCGCATGAGTTAGGTATGACTAG- 
5 ' 

[0302] The internal standard sample used for this assay was a mixture of total RNAs from tissues (or culture cells; 
all from UNITECH) of brain . kidney, NT2, testis, thymus, and trachea. RNA was prepared according to the standard 
method. 

[0303] The sequences of primers specific to the genes and the names of clones of interest in the analysis are as 
follows. The gene specific primers were designed to produce the PCR products of 70 to 200 bp, which are derived 
from the adapter-containing cDNA. The sequence of adapter-specific primer (labeled with fluorescence (FAM)) used 
in the competitive PCR was GTACATATTGTCGTTAGAACGC (22 nucleotides; SEQ ID NO: 3953). PCR was basically 
carried out with a cycling profile of preheating at 94°C for 3 minutes, and 35 or 40 cycles of denaturation at 94°C for 
30 seconds/annealing at 50°C for 60 seconds/extension at 72°C for 90 seconds. 
The nucleotide sequences of clone specific primers used in the experiments 

[0304] Clone name, primer sequence and SEQ ID NO are indicated below in this order. Each is demarcated by a 
double slash mark (if). 



ADRGL2 0 036380/ / CTACTCAAGGACAGCCACAC / / SEQ 


ID 


NO: 


3954 


ASTRO? 0045840// GGATGTAGTGGGAAACAATG //SEQ 


ID 


NO: 


3955 


ASTR02 0055930/ /TGCTTTTCATTCTCCTTAGT/ /SEQ 


ID 


NO: 


3956 


ASTR02 0088950// TACGTGCTCATTTACTTGGT/ / SEQ 


ID 


NO: 


3957 


BNGH42 00523 50 / /GCCAGTTTCTTTATGATTGA// SEQ 


ID 


NO: 


3958 


BRACE20052530//AATGACTTCGTTAGGATGCC// SEQ 


ID 


NO: 


3959 


BRACE200 54080//GCTGTTGACTTCATTTGGAA//SEQ 


ID 


NO: 


3960 


BRAMY2 0003880/ / TTGGTACTTATTCTGAGGCA/ /SEQ 


ID 


NO: 


3961 


BRAMY2 0027390/ / GATTTTAGTGAAACATGCCA// SEQ 


ID 


NO: 


3962 


BRAMY2 0 028530/ / TTAAAACTG AGG ACATTCTG/ / SEQ 


ID 


NO: 


3963 


BRAMY20 03538 0//AGAGAAGGCAGTCTAGCTTA/ /SEQ 


ID 


NO: 


3964 


BRAMY20 036530/ / AGGGTATGGTAACTTCTGCA/ /SEQ 


ID 


NO: 


3965 


BRAMY2 0050940/ / AAAGGAGGGACTAGAAAACT/ /SEQ 


ID 


NO: 


3966 


BRAMY 2 0072440/ / ACT ATGACGAGGGAACAAGA/ /SEQ 


ID 


NO: 


3967 
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BRAMY 20096930// G AGG AG AACACAAGTATGGT / / SEQ 


ID 


NO; 


3968 


BRAMY20 118410/ / AAGGTCACTTCTAAACACAC/ /SEQ 


ID 


NO: 


3969 


BRAMY20237190//GGAGTGATTCAGGAGATGTG//SEQ 


ID 


NO; 


3970 


BRAWH2005533 0//GCAACAGAGACTTTATTGGT//SEQ 


ID 


NO: 


3971 


BRAWH 20078620// GAGAGACTTATCAC AGCCAT //SEQ 


ID- 


NO: 


3972 


BRAWH2 0 19 0530/ /ATGGGATTCTGTGACTTCTC/ /SEQ 


ID 


NO: 


3973 


BRCAN2 0001480/ / C AGCAACAGTAATGGGAATT/ / SEQ 


ID 


NO: 


3974 


BRHIP10 00072 0//AGGTTAGGATTTCTTTAGCA/ / SEQ 


ID 


NO: 


3975 


BRH IP10001040// T ACTTGGAGAC AACAGGGAG/ / SEQ 


ID 


NO: 


3976 


BRHIP20000210//GTGTTTGTGGGCATAGACAT//SEQ 


ID 


NO: 


3977 


BRSSN20 00197 0//AATGTATTCAGTTCCTTTCC//SEQ 


ID 


NO: 


3978 


BRSSN2 0091190/ /GTGTCATCACTAGCACCAAG/ / SEQ 


ID 


NO: 


3979 


CD34C2 0001750/ / TGGACTTAGGG ACCTGACTC / /SEQ 


ID 


NO: 


3980 


CTONG20078340 / /CTCTTTACCTAGTTTGGTCA//SEQ 


ID 


NO: 


3981 


CTONG2 0 079590// TACTTATTTTCAC AGGGGCC/ /SEQ 


ID 


NO: 


3982 


CTONG20 083980// CAGCATTTTCCTATATAGCC/ / SEQ 


ID 


NO: 


3983 


CT0NG2 0085210// CC AG AAG AGT AGC AAGAATT/ / SEQ 


ID 


NO: 


3984 


DFNES20063460//CTATTTTAACCCCTGCCCTC//SEQ 


ID 


NO: 


3985 


DFNES20 072990//GGAGGTATCTATTAGGGTGA//SEQ 


ID 


NO: 


3986 


FCBB F 2 0 0 2 9 2 8 0 / / G AC T G AGATGAACTG GAAG A //SEQ 


ID 


NO: 


3987 


FCBBF2 0 032930 / /TCACAATACAGTCCCCTAGT/ / SEO 


ID 


NO: 


3988 


FCBBF2 0036360/ / ATTTGTATCACTTTGGTGCA/ / SEQ 


ID 


NO: 


3989 


FCBBF3002268 0//CTCCAGAAAATGCATGAATC/ /SEO 


ID 


NO: 


3990 


FCBBF300 78600//CTTCAACAGTGCTTTTCCTT// SEQ 


ID 


NO: 


3991 


FCBBF30105080//CTGTGCACCCACTCTTTATT//SEQ 


ID 


NO: 


3992 


FCBRF ^0 1 fi9 fi 70/ /TCCAGTATTTTCCACTTTGA/ / SEO 


ID 


NO : 


3993 


f LdDr jy^^jj JV/ / /i^ I/IX X J. iniool UAtouUL. / / OZt^> 


TD 

x U 


MO • 


Q Q 4 

«J 17 .7 *S 


FCBBF50000610//AGTTAACGTATCTGGCAAAG//SEQ 


ID 


NO: 


3995 


FEBRA2 0007820 / /GTTTCTCACTGTCCTGTrTT / / SEQ 


ID 


NO: 


3996 


FEBRA2 003128 0 // ACTATTTTATGGTCACGGCC / / SEQ 


ID 


NO: 


3997 


FEB RA2 0031810// TGCAATC ATC TC TGTATCCC //SEQ 


ID 


NO: 


3998 


FEBRA2 0039260 //GTCAGAACCCACTTCACATC / /SEQ 


ID 


NO: 


3999 


FEBRA20046280//TCTCTGTCCrGTTGTCTAAG//SEQ 


ID 


NO: 


4000 


FEBRA2 0084750/ / TTAG C ATGTACTGGGAAAGC //SEQ 


ID 


NO: 


4001 


FEBRA2 0182030 / / AAAACACAAAATGACACCCC/ / SEQ 


ID 


NO: 


4002 


HLUNG2 0041540/ / AAAGTTCCTCTGCATTCACC/ / SEQ 


ID 


NO: 


4003 
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HLUNG20 092530/ / TTTTCATCCCAGAGTT ATTA/ / SEQ 


ID 


NO: 


4004 


KIDNE200 840 3 0 / / AGGGAATAACTTGCAGCTTG/ / SEQ 


ID 


NO: 


4005 


KIDNE200 84800/ /GTAATGTAGGGAGACTGCCG/ / SEQ 


ID 


NO: 


4006 


KIDNE2 01 3 4 1 3 0/ / AATCCCCTCTTTTGTCTCAT/ / SEQ 


ID 


NO; 


4007 


KIDNE20 182540/ / ACAGATAGCCTGGATTGAAA/ / SEQ 


ID- 


NO: 


4008 


KIDNE 20 186170// TTGTATCTGAGCTGGGGTTT/ / SEQ 


ID 


NO: 


4009 


KIDNE 20188630/ / CCCTACATATCTCT ACCCAT / / SEQ 


ID 


NO: 


4010 


LI VER2 0007750/ / TATTTAGAAACGC AG ACCCC/ / SEQ 


ID 


NO: 


4011 


MES AN20 02 1 220 / /TAGAAGTCAACAAAAGGCAC/ / SEQ 


ID 


NO: 


4012 


MESAN20 0 84 1 50 / /TCCATAAGGCACAGATTTGA/ / SEQ 


ID 


NO: 


4013 


NT2NE2 0 0 59 2 1 0 / / ATAATGACAATGCCAGTAGT/ / SEQ 


ID 


NO: 


4014 


NT2NE200B2130/ / TGAGGTACATCCAAATTAAA/ / SEQ 


ID 


NO: 


4015 


NT2NE 20092950 / /ATGATTACTCGGTTTCCAGA/ /SEQ 


ID 


NO: 


4016 


NT2RP 70031070// CAGTTAGTAGACAG ACGGGG/ /SEQ 


ID 


NO: 


4017 


OCBBF20012520/ /TCTGCCTGTAGTTGCCATTA/ /SEQ 


ID 


NO: 


4018 


OCBBF20 110210/ / AGGTGATAGGACTTTGTGCC/ / SEQ 


ID 


NO: 


4019 


OCBBF2 01 10730/ /TTAGATGCTCCCTAAGGTCC//SEQ 


ID 


NO: 


4020 


OCBBF2 0155030 / /GCTAAAATCGTGCATCTGTA/ / SEQ 


ID 


NO: 


4021 


OCBBF20 1 65900/ / AGTTTTGTATCTCCTTGTCA/ / SEQ 


ID 


NO: 


4022 


OCBBF20 170350/ / TAAGATGGAGTTCAGGGGAG/ / SEQ 


ID 


NO: 


4023 


OCBBF20 17 6650 / / GCACACAGGCAAATTCTAGT/ / SEQ 


ID 


NO: 


4024 


PLACE 60006300/ / TTCTGTAATAAGGGCTGTCA/ / SEQ 


ID 


NO: 


4025 


PLACE 60061370 / /TGTTCACAAATGGCATAAAA/ / SEQ 


ID 


NO: 


4026 


PROST20 011160// CTACTAACTCAACCACGCAT / / SEQ 


ID 


NO: 


4027 


PROST20 04 1 4 60/ / CCATTTACGTCACCTCTCTG/ / SEQ 


ID 


NO: 


4028 


PROST2006 5100//ACTATTTTATGGTCACGGCC//SEQ 


ID 


NO: 


4029 


PROST200 7 52 80 / / ACGTTGACTCTGATAGCCTG//SEO 


ID 


NO : 


4 030 


PROST20 106060/ /AATTCTTTTGACATTGCTTG//SEQ 


ID 


NO: 


4031 


PROST 20110120// G AT AAAT T C AG C AAG AG CAT / / SEQ 


ID 


NO: 


4032 


SKMUS 2 0091900/ / AAC TCTGCACTCCATAACTG/ / SEQ 


ID 


NO: 


4033 


SMI NT 2 0 02 4 1 4 0 / / AAGCCTCTAAAAGTC AACAC/ / SEQ 


ID 


NO: 


4034 


SMINT2 0 092160/ / TTAAACAAGTGAGCCTC AGA/ / SEQ 


ID 


NO: 


4035 


SPLEN2 0 04 07 8 0 / /TTTCCTGTTTGGTTAGTTTT/ /SEQ 


ID 


NO: 


4036 


SPLEN2 0 110860/ / CTGACGGAAAACTTCTAATT/ / SEQ 


ID 


NO: 


4037 


SPLEN201 774 00/ /ATATCTGGTTGTTGGGTTTT//SEQ 


ID 


NO: 


4038 


TESTI20038240//GTCTGTCTTGATGGATTGGA//SEQ 


ID 


NO: 


4039 
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TESTI 20 0 4 ^1 ^0 / /AArTATCAfiArTKrAAfiAGC/ / ^EO 


ID 


NO; 


4 f)4 fi 
i u i \j 


lJLOii^W^O'J'il/ / / 131? X AVjL,k»AA 1 AVjL»AAAV«,AVj\j/ / dCiy 


Tn 




a n a i 

u ft 1 


lLOx / J / U / / ALU 1 1 uLAlnnx twl \^AVj IL/ / 


Tn 


KTO . 


A n A o 

ftUffciC 


IL^li^UUJ/iUU / / AVj 1 L.v.-UAvj I L. 1 Ulnu l I Loo/ / OL^ 


1 JJ 


vto . 


A C\A "5 
ft U ft ,3 


xLollZUUD / DyU / / ALAl 1 1 IbblAl 1 (aA^AU 11// ohy 


ID 


MO . 


A fl A A 
ft Ufi fi 


ILoll^Ul 1 jy fl U / /CjTCAGTCCALLI IALiLTI 1 / / bfcQ 


ID 


NO: 


A f\ A C 

4045 


Ihbn^Ul^sooU/ / CAAACGATTACSACACAAAA/ / St Q 


ID 


NO: 


/I rk vl c 
4 04 6 


TEbTI^U lDloOU//CGTTCCTCAGGTAGCAAGAT/ / SEQ 


ID 


NO: 


A t\ A "7 

4 047 


TESTI 20 1/ 30 50 / / ACATGGCTGAAGGTGATTTT//SEQ 


ID 


NO: 


404 8 


TESTI 2 0 1 9 8 6 0 0 / / TTTAGAAACATTGGCATCAG/ /SEQ 


ID 


NO: 


4049 


TESTI202579 10/ /CTGCCTAGAGTAGAACAAAA//SEQ 


ID 


NO: 


4050 


TE ST 120262940 / /CTCCCAATCTCAAACACAAG/ /SEQ 


ID 


NO: 


4051 


THYMU2 004 6770/ /CTTCTGCCGAGTTTGTGTAA//SEQ 


ID 


NO: 


4052 


THYMU20 0585 50 / /GATGCTGAGAAGGTGTTAGT/ /SEQ 


ID 


NO: 


4053 


THYMU20062520/ /AGTCTCAGGATGGGTAAAGG//SEQ 


ID 


NO: 


4054 


TH YMU2 0 0627 70/ / AGAGTTAAGAACCGAGGGAT/ / SEQ 


ID 


NO: 


4055 


TH YMU2 0078240// CAAGCCAGGG AG ATAGACAT/ / SEQ 


ID 


NO: 


4056 


THYMU2 0 150190 / /TACTACAATGTGGGCTACGG/ /SEQ 


ID 


NO: 


4057 


TRACH 20125620// CCACATTGTAAAC AGTCCTT/ /SEQ 


ID 


NO: 


4058 


TRACH2 0 149740/ / AGATACATTTTCCGTCAAGC7 /SEQ 


ID 


NO: 


4059 


TRACH 20190460/ /CC AG AAGAGTAGC AAG AATT/ / SEQ 


ID 


NO: 


4060 


UTERU2 0 045200/ / ATTCAACTAAAACAAAGCTG/ / SEQ 


ID 


NO: 


4061 


UTERU2 0 064120/ / ACCCAG AAAAGAG ATGAGAA/ / SEQ 


ID 


NO: 


4062 


UTERU2 0103200// CTGTTCCTGGCAAATAAGAG/ / SEQ 


ID 


NO: 


4063 


ADRGL2 0046760/ /ATGTGAAGGAATGATGTACT// SEQ 


ID 


NO: 


4064 


ASTR02 0055530/ /GAATAATGAAGGGGACCAGA/ /SEQ 


ID 


NO: 


4065 


BRAMY2 0076130/ / CCTTTCATGTCTCAGTATTT// SEQ 


ID 


NO: 


4066 


CT0NG2 0170940/ / ATACGTCAGAGGACACATGC/ / SEQ 


ID 


NO: 


4067 


FCBBF2 0033360/ / TCCGT AGC AGTAGAAACATC/ / SEQ 


ID 


NO: 


4068 


FCBBF3 0257370/ /CAGGTATGCTTGGAGTTTCA//SEQ 


ID 


NO: 


4069 


FCBB F50001650/ / CGTGAT TAGGAAGGG ACAGT/ / SEQ 


ID 


NO: 


4070 


FEBRA2 0 0 4 029 0 / /CCCAAGAACGAAACAAAACT// SEQ 


ID 


NO: 


4071 


FEBRA2 0063720/ / AAT AATGCCCACCTATAAAA/ / SEQ 


ID 


NO: 


4072 


FEBRA2 0098040/ /GAGGGGAATTGTC AGTACAC/ /SEQ 


ID 


NO: 


4073 


FEB RA2 0108580/ /TCATTTTGTCTAGTGCCCAT/ / SEQ 


ID 


NO: 


4074 


MESAN2 0021860 / / ACACATTCCCATCAATAGGT/ / SEQ 


ID 


NO: 


4075 



128 



EP 1 308 459 A2 



MP 9 AN? HO 74 "3 n / / A^rTAA^fiAGRTTTTCACAT / / SEO 


ID 


NO - 


4076 


NT9MF9fin4 SI Qfi / /nftAAT ATRTT^RfTA^TTA / /qyn 

IMl^nb^UvUl^U/ / UOrtn 1 1 w X XVJOwVl AU X X t\f f ^CiU 


TD 


NO ' 


4077 


oDACTonm c*7 £ n / /aarTTraTrraTTrrAarTn / / cpo 

rKUo IZUUIO f OKJ / / J\r\\^ x x Lnl Lvnl 1 l^^/i/iv^X O/ / Jby 


t n 

X u 


MH » 

IN \J . 






t n 


MA - 


4 n 7 q 

ill / " 


oMJ.lv 1 zUUUbUzU/ / LI i bul Ibi LttLl 1 1 Ul/iL?/ / o£<W 


x u 




4 n ft ft 


iLDilZUU / U/ / v3<jtU x VarlU ill ill LHALno/ / OCjV 


Tn 


MA . 


dn ft i 


TEST 120 103690 / / CCACTTTATTTCTCCiLCC 1 / / or*Q 


t rs 


InU : 




TESTI202 544 80//GTGGACACAACTTGCTTTAC/ /SEQ 


ID 


NO: 


4083 


THYMU100 042 80//GAGAGTCTGCCTAGCTGTGT//SEQ 


ID 


NO: 


4084 


THYMU 20030460 / / GCCCATGTGAGTAGGTGTAG / / SEQ 


ID 


NO: 


4085 


TRACH2 009 0060/ /AGGGCCAACTTAAATCTCTG/ / SEQ 


ID 


NO: 


4086 


UTERU2 0 0 41970 / /GATAAACCCCAAACATGAAA/ / SEQ 


ID 


NO: 


4087 


BRAMY2 0125360// GACAGACTAAAACGTTGAGC / / SEQ 


ID 


NO: 


4088 


OCBB F2 0142290// CC AAGG AG AGTC AGTG ACAG / / SEQ 


ID 


NO: 


4089 


SKMUS200067 90//TTCAAAAGCAGAGACTCCCT//SEQ 


ID 


NO: 


4090 


TESTI20030610// TG AACTCAGTACCAGGCTTG/ /SEQ 


ID 


NO: 


4091 


UTERU2 0026620 / /GAGATTCCCTAGTGGTGGTT/ / SEQ 


ID 


NO: 


4092 



[0305] The result of expression frequency analysis is shown in Table 40. The clones not shown in the table contain 
30 clones whose expression levels could not be measured because the levels were too low or the sizes of the PCR 
products were different from the expected. It was confirmed that the expression levels of IL-8 gene used as a positive 
control gene were elevated. 

[0306] The result obtained by the search for the genes whose expression levels were altered depending on the 
presence of TNF-cc in culturing THP-1 cell, which is a human monocyte cell line, showed that the clones whose ex- 
35 pression levels were elevated by twofold or more one or three hours after the stimulation (the clones whose expression 
levels were 0.1 or lower both before and after the stimulation were excluded), were 

ASTRO20055530, ASTRO20055930, ASTRO20088950, BRAMY20027390, BRAMY20076130, BRAMY2011841 0, 
BRAMY201 25360, BRAMY202371 90, BRCAN20001480, BRHIP1 0000720, CD34C20001750, CTONG20078340, 
CTONG20085210, DFNES20063460, FCBBF20029280, FCBBF20033360, FCBBF30078600, FEBRA20007820, 

40 FEBRA20031280, FEBRA20031810, FEBRA20040290, HLUNG20041540, HLUNG20092530, MESAN20021860, 
MESAN20067430, MESAN20084150, NT2NE20092950, NT2RP70031 070, OCBBF2001 2520, OCBBF201 42290, 
OCBBF201 65900, OCBBF201 70350, OCBBF201 76650, PLACE60006300, PROST20011160, PROST201 06060, 
SPLEN20040780, SPLEN201 1 0860, SPLEN201 77400, TESTI2003061 0, TESTI200431 30, TESTI20059370, 
TESTI20254480, THYMU 10004280, THYMU 20030460, THYMU20062520, THYMU20078240, THYMU20150190, 

45 TRACH20090060, TRACH20 125620, UTERU20026620, UTERU20045200, UTERU200641 20, UTERU20 103200. 
[0307] On the other hand, in particular cases where the expression levels were relatively high in the unstimulated 
cells (the relative value was 1 or higher), the clones whose expression levels were decreased by twofold or more by 
the TNF-ot stimulation (the clones whose expression levels were increased 1 or 3 hours after the stimulation were 
excluded) were 

50 BNGH420052350, BRACE20052530, BRAMY20003880, CTONG201 70940, FCBBF30022680, FCBBF30225930, 
FCBBF30257370, FEBRA20046280, KIDNE20084030, KIDNE201 88630, NT2NE20082130, OCBBF2011 021 0, 
PLACE60061370, PROST20041460, PROST20075280, PROST2011 0120, SMINT20006020, TESTI20046540, 
TESTI20057200, TESTI201 13940, TESTI2025791 0, TESTI20262940, TRACH201 49740. 
[0308] These clones were thus revealed to be involved in the inflammation reaction induced by TNF-cc. 

55 [0309] The result obtained by the search for the genes whose expression levels were altered depending on co- 
culturing gastric cancer cell line MKN45 with cag PA I positive Helicobacter pylori (TN2) , showed that the clones whose 
expression levels were elevated by twofold or more (the clones whose expression levels were 0.1 or lower both before 
and after the stimulation were excluded), were 
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BRAMY20028530, BRAMY20035380, OCBBF20 170350, PROST20011160, SKMUS20091900, SPLEN20040780, 
THYMU20078240, TRACH201 90460, UTERU20045200, UTERU20064120, ASTR020055530, CTONG201 70940, 
FEBRA20040290, MESAN20067430, PROST20016760, THYMU10004280, TRACH20090060, UTERU20041970, 
OCBBF201 42290, TESTI20030610. 

5 [0310] Of these clones, the expression levels of BRAMY20035380, SKMUS20091900, SPLEN20040780, 
UTERU20064120, CTONG201 70940, OCBBF20 142290, TESTI20030610 were not increased by the co-culture with 
the cagE mutant (TN2AcagE). There may be the possibility that the expression levels of the 7 clones are altered via 
the NF-kB pathway. Among them, the expression levels of OCBBF201 42290, SPLEN20040780, TESTI20030610, 
UTERU20064120 were also increased when human monocyte cell line THP-1 was stimulated with TNF-a. 

10 [0311] On the other hand, in particular cases where the expression levels were relatively high in the unstimulated 
cells (the relative value was 1 or higher), the clones whose expression levels were decreased by twofold or more in 
the presence of Helicobacter pylori were 

ASTRO20088950, BRACE20052530, BRAMY20003880, BRAMY20027390, BRAMY20036530, BRAMY201 1 841 0, 
BRHIP20000210, FCBBF20032930, FCBBF30022680, FCBBF301 69870, FEBRA201 82030, KIDNE201 82540, 
15 LIVER20007750, MESAN20021220, NT2NE2005921 0, NT2NE20082130, OCBBF201 55030, PROST20065100, 
PROST20075280, SPLEN20110860, TESTI20057200, TESTI20113940, TESTI201 49880, TESTI20151 800, 
TEST1201 98600, TESTI2025791 0, THYMU20046770, THYMU20058550, THYMU201501 90, FCBBF20033360, 
FCBBF30257370, FEBRA20098040, SMI NT2 0006020. 
[0312] These clones are involved in gastritis or gastroduodenal ulcer. 

20 



Table 3 



25 



30 



35 



40 



45 



50 



Clone ID 


CD34C 


D30ST 


D60ST 


D90ST 


ASTRO200 10290 


0 


55.437 


0 


0 


BRAMY20036530 


0 


33.144 


0 


0 


BRAMY20043630 


0 


0 


0 


13.575 


BRAMY20089770 


0 


0 


0 


63.803 


BRAMY201 90550 


0 


0 


0 


31.089 


CD34C20001 750 


100 


0 


0 


0 


FCBBF20066340 


0 


0 


0 


76.503 


FEBRA20040290 


0 


0 


0 


14.912 


HLUNG20015180 


0 


0 


0 


8.491 


HLUNG20041590 


0 


0 


0 


1 8.349 


HLUNG20052300 


0 


36.241 


0 


0 


KIDNE20084040 


0 


0 


0 


65.916 


MESAN20021860 


0 


0 


0 


56.046 


MESAN20027240 


0 


0 


0 


33.731 


NTONG20055200 


0 


0 


0 


19.731 


PROST200 16760 


0 


0 


0 


15.442 


PUAEN1 0001 640 


0 


34.074 


0 


0 


SMINT20006020 


0 


7.702 


0 


9.021 


SMINT20028840 


0 


68.605 


0 


0 


SMINT20035050 


0 


9.417 


0 


11.029 


SPLEN20181570 


81.506 


0 


0 


0 


TESTI20064530 


0 


0 


77.29 


0 


TESTI20210030 


0 


0 


0 


90.471 


THYMU20029830 


0 


42.091 


0 


0 


THYMU20139160 


0 


0 


0 


3.486 


TRACH20051590 


0 


0 


0 


75.21 7 


Table 4 


Clone ID 


NT2RM 


NT2RP 


NT2RI 


NT2NE 


ADRGL20023920 


0 


6.69 


0 


0 
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Table 4 (continued) 



5 



w 



15 



20 



25 



30 



35 



45 



55 



Clone ID 


NT2RM 


NT2RP 


NT2RI 


NT2NE 


ASTR020009140 


0 


o 


13.389 


o 


BNGH420077980 


0 


0 


20.754 


0 


BNGH420086030 


0 


0 


0 


13.322 


BRACE20062580 


0 


0 


3.585 


2.717 


BRACE20079370 


o 


0 


9.312 


14.111 


BRACE20215410 


0 


o 


27.1 96 


o 


BRAMY20003540 


o 


o 


3.208 


4.861 


BRAMY20043630 


o 


o 


2.408 


0 


BRAM Y200761 30 


o 


o 


6.601 


o 


BRAMY20095080 


0 


o 


25.606 


0 


BRAMY20227860 


o 


2.798 


6.1 7 


3.596 


BRAWH20082550 


o 


o 


o 


9.738 


BRHIP1 0001 040 

i— > i ii ill 1 \J\J\J 1 \J"ryj 


o 


o 


o 


2.323 


BRSSN2000561 0 


o 


22.393 


o 


o 


CTONG20027660 


o 


o 


50.642 


o 


CTONG20044230 


o 


o 


50.642 


o 


CTONG200661 1 0 


o 


o 


19.1 31 


o 


CTONG2007QSQ0 


o 


1 4.297 


29.1 02 


o 


CTOIMG900 84660 


o 


o 


2.553 


0 




o 


8.861 


o 


9.111 




Q 


9.056 


o 


o 


CTONG201 88080 


o 


10.31 9 


o 


o 


FCBBF20023490 


o 


7.376 


o 


7.583 


FCBBF20033360 


o 


o 


5 01 5 


o 


FCBBF20059660 


o 


0 


0 


66.235 


FCBBF20070Q50 


o 


o 


15.367 


o 


FCBBF30004340 


o 


o 


0 


8.778 


FCBBF300Q541 0 


o 


79 235 


o 


o 


FORRF^OI 2^460 


o 


o 


o 


9.321 


FCRRF^OI 7Q1 80 


o 


o 


56.41 8 


o 


FCRRF ^09^66 70 


o 


1 .325 


4.494 


2.724 


FCRRF r *09S7'370 


o 


19.522 


o 


o 


FrRRFmirifinfii n 


n 
yj 




R6 41 R 

OO.H I O 




rODDrOUUU 1 DOU 


n 






99 1 R1 

d-d. . I O I 


rCDnntUUOOOOU 




16 194 




n 


FFRRA9nn'3Q960 




7 1 69 


9 A1 


n 


PFRRA9finR^79n 
rCDnrt/iUUOJ / £\) 


u 




n 
u 


n 
u 


FEBRA20090220 


0 


4.602 


3.122 


4.732 


FEBRA201 50420 


0 


0 


33.26 


0 


HEART1 0001 490 


0 


0 


9.27 


0 


HLUNG20032460 


0 


0 


0 


21 .278 


HLUNG20041590 


0 


0 


0 


4.932 


KIANE20089870 


0 


0 


0 


4.145 


MESAN20016270 


0 


0 


39.208 


0 


MESAN20021860 


0 


0 


0 


7.532 


MESAN20060430 


0 


0 


24.385 


0 


MESAN20067430 


0 


0 


0 


9.568 


NT2NE200 18740 


0 


0 


0 


100 


NT2NE20018890 


0 


0 


0 


100 


NT2NE20021860 


0 


0 


0 


100 
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Table 4 (continued) 



5 



10 



15 



20 



25 



30 



35 



45 



55 



Clone ID 


NT2RM 


NT2RP 


NT2RI 


NT2NE 


NT2N E20026200 


o 


0 


0.808 


2.449* 


NT2N E2002651 0 


o 


o 


o 


100 


NT2N E20028700 


o 


o 


o 


22.223 


NT2N E20033 1 50 

1 Iff 1 C— 1 ti Www 1 ww 


o 


o 


o 


50.133 


NT2N F°00370n0 


o 


o 


o 


1 00 


NT2N F2003RR70 


o 


o 


o 


1 00 


NTPN FP003Q21 0 


o 


o 


o 


1 00 




o 


o 


Q 


rq nft*3 


NTPMP°ft04R1 QO 


o 


0 


5.05 


1 R **flR 

I OLIO 


NT2NI F20047R70 


o 


o 


o 


100 


ntpm poaacoooa 


o 


o 


A 

u 


mo 

1 LILI 


mtom pooor^qro 


u 


Of) acq 


A 
U 


OA RAA 


IN I ^INC^UUOa^ I U 


a 
u 


A 
u 


A 
U 


i no 
1 uu 


ntom fooororao 


u 


a 
u 


A 
U 


1 oo 

i ULI 


IN I cINCcUUDU/OU 


n 
u 


n 
u 


A 
LI 


1 no 
I uu 


IN I cINCcUUD I UOU 


u 


1 ft 

%3i7. I o 


A 
U 




MTOM P9nnR9Pftfl 


u 


u 


A 

U 


i OO 
1 UU 


MTOMPOOOR47RO 


u 


u 


A 

u 


1 OO 
I UU 


MTOMPOOORRRQO 




u 


A 

u 


1 OO 
I UU 


mtompooorqrro 

IN I C.IN ciuuoyoou 


U 


u 


A 

u 


CQ ADO 
J3 .UC50 


MTOMFoon70Roo 


a 
u 


u 


A 

u 


1 OO 
I UU 


IN I <!INC_tUU/ OODU 


u 


u 


A 
U 


1 oo 
I uu 


mtomfooo770ro 


u 


a 
u 


A 
U 


1 oo 
I uu 


MTOMP00077070 


n 
u 


u 


A 
U 


1 oo 
1 uu 


mtom pooo 77 rro 

IN 1 ^IMC^UU / / ODU 


u 


A 

u 


A 
LI 


1 oo 
1 uu 


mtompooo7qr70 

IN 1 ctVC^UU / v7U / U 


a 
u 


u 


A 
LI 


■i oo 
1 uu 


mtom poooro 770 

IN 1 tZi^UUOU / /u 


a 
u 


A 

u 


A 

u 


i OO 

f uu 


mtompoooroi 

IN I ^IN L^vUOd 1 OU 


a 
u 


A 
U 


A 
LI 


OQ AQ 

oo.uy 


mtq m p 9nnno Rftn 

IM 1 ^iM ttUUOtDUU 


a 
U 


A 
U 


A 
LI 




IN 1 *£(\l CiiUUdOU/U 


U 


A 
U 


A 
U 


■1 AA 

1 uu 


IN 1 ^INP^UUDf £/\J 


u 


A 


A 
U 


■! OO 

1 uu 


IN 1 <£IN t^UUO / OOU 


a 
u 


A 

u 


A 
U 


1 uu 


MTOM COrtOQQn'Jn 

in i ^r\j tdUUtJouou 


U 


U 


U 


bU.4^:o 


mtom connooocn 

IN 1 ^if\ t^UUy^HiDU 


U 


OA OQO 


U 


OA Q ~7 


mtom PonnoRo^n 


U 


0 


U 


CC H OQ 

OO. 1 OO 


In 1 i^fN t*iUl U4UUU 


0 


0 


0 


100 


MTOM CorH A"7QH A 

In 1 ^In t*:Ul U/oi U 


0 


0 


0 


100 


NT2NE201 08420 


0 


0 


0 


45.407 


NT2NE20111190 


0 


0 


0 


61.289 


NT2NE20112210 


0 


0 


0 


26.669 


NT2NE20114850 


0 


0 


0 


100 


NT2NE201 17580 


0 


0 


0 


100 


NT2NE201 19980 


0 


0 


0 


100 


NT2NE201 23610 


0 


0 


0 


100 


NT2NE201 24570 


0 


0 


0 


100 


NT2NE201 26030 


0 


0 


0 


100 


NT2NE201 27900 


0 


0 


16.514 


25.025 


NT2NE20140130 


0 


0 


0 


100 


NT2NE201 40280 


0 


0 


0 


100 


NT2NE20141040 


0 


0 


0 


36.013 
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Table 4 (continued) 



Clone ID 


NT2RM 


NT2RP 


NT2RI 


NT2NE 


NT2N E20 1 45250 


o 


o 


0 


100 


NT2NE20 146510 


o 


0 


0 


100 


NT2NE201 48690 


o 


0 


o 


67.567 


NT2NE201 49500 


0 


0 


0 


100 


NT2NE20150610 


o 


0 


0 


100 


NT2NE201 52620 


o 


o 


o 


100 


NT2NE20 153620 


Q 


o 


o 


60 858 


NT2NE201556S0 


Q 


Q 


o 


1 00 


NT2NF20157120 


o 


o 


o 


1 00 


NT2NE20165190 


o 


o 


o 


100 


NT2NF201 67660 


o 


o 


o 


1 00 


NT2NF201 7^Q70 




o 


o 


1 00 


MT2NF201 7791 0 


ft 


o 


o 


43.734 


WTPNIF201 R1 760 

IN 1 tINLtU 1 O 1 / UU 


Q 


o 


o 


1 00 


WT9NF2ni ri nnn 

IN 1 cINCtv 1 O 1 OVsls 


o 


o 


o 


46.215 


NT2NF9ft1 R479ft 

IN 1 tlN Ctv 1 04/ tU 


ft 


o 


Q 


1 00 


NT9RI9ftft1 R24H 

IN 1 cnlcUU 1 OtHU 


ft 


o 


1 00 


0 


IN I tnl<lUUt l£UU 


o 


ft 


8 844 


ft 


IN 1 cnitUU009tU 


o 


Q 


1 00 


ft 


MT2RipnnQ^m n 


Q 


o 


1 00 


ft 


MT2RP7nnm 1 pn 

in 1 1 n i / vjv/tj i i cu 


ft 


Ot .3 / O 


ft 


ft 


NT9RP70001 71Ci 

IN 1 till / \J\J\J 1 / OU 


ft 




ft 


ft 


NT2RP7000 < *11 0 
in i tnr / uvuo I I u 


Q 


24 333 


ft 


ft 


NT9RP7ftft1 PR^ft 
In I tnr/UU I cOOU 


ft 


ft^Q 


ft 
U 


R 7QR 


NTPRP7ftftPPRpft 

IN I t Pi r / UUiicOtJ 


ft 




ft 
U 


U 


NTPRP7ftftP77Qft 


ft 


ftft 1QA 


ft 
U 


ft 
u 


NTPRP7009Q7R0 

IN 1 till / ULIt cl / OU 


ft 


1 ftft 


ft 


ft 


In l till / UU0U04U 


n 


1 ftft 
I uu 


ft 
u 


ft 

u 


In I till/ W L>0 I U/U 


u 


1 ftft 
I uu 


ft 
u 


ft 

u 


IM I tnr / UL»0 I 0*+U 


ft 
u 


1 ftft 
1 uu 


ft 
u 


u 


MT9RP7nfl^1 zlRft 
IM 1 t rir / UUO I *fOU 


ft 
u 


1 nn 

1 UU 


U 


U 


im i ^rtn / uuoo i i u 


ft 
u 




U 


U 


MT9RP7nn4R41 ft 
IM I t rir / UWD't I U 


ft 
u 


ft/1 QftQ 


U 


U 


MTPRP7ftft4Qft1 ft 
IM I til" / I U 


ft 
u 


1 UU 


r\ 
U 


U 


MT9RP7rtn(;ft9Qn 

IM \ 4LrW I uuoo<iy u 


ft 

u 


•i nn 
1 UU 


U 


n 
U 


MT9PD7nnKRRQn 
IM 1 /LrW^ f UUODDyU 


u 


•i nn 
i UU 


0 


0 


Im 1 z:nr/UUO/OUU 


U 


HOC 

lo.b 


0 


0 


NT2RP70064570 


0 


60.194 


0 


0 


NT2RP70074800 


0 


100 


0 


0 


NT2RP70075300 


0 


6.726 


0 


0 


NT2RP70075800 


0 


100 


0 


0 


NT2RP70080150 


0 


100 


0 


0 


NT2RP70084540 


0 


100 


0 


0 


NT2RP70087140 


0 


7.444 


15.153 


7.654 


NT2RP70090870 


0 


17.122 


11.617 


0 


OCBBF20001780 


0 


0 


15.351 


0 


OCBBF20009820 


84.348 


0 


0 


0 


OGBBF20142290 


0 


0 


0 


14.577 


OCBBF201 55030 


0 


49.439 


0 


0 


OCBBF201 75360 


0 


0 


0 


20.881 
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Table 4 (continued) 



Clone ID 


NT2RM 


NT2RP 


NT2RI 


NT2NE 


OCBBF201 77540 


0 


25.509 


0 


0 


OCBBF201 77910 


0 


0 


23.505 


0 


PLACE60054820 


0 


0 


20.089 


0 


PLACE60061370 


0 


18.07 


0 


0 


PLACE60073090 


0 


10.617 


0 


0 


PLACE60162100 


0 


0 


0 


37.629 


PROST20011800 


0 


0 


0 


50.542 


PROST20045700 


0 


0 


0 


44.59 


PROST2007871 0 


0 


0 


0 


33.633 


PROST20094000 


0 


0 


18.34 


0 


PUAEN1 0000650 


0 


3.375 


0 


0 


PUAEN1 0001 640 


0 


5.216 


3.539 


0 


SKNMC20006350 


o 


2.005 


2.041 


4.124 


SMINT20016150 


0 


49.972 


0 


0 


SMI NT20030740 


o 


5.712 


o 


0 


SMINT20035510 


0 


0 


28.984 


0 


SM I NT20039050 


o 


o 


18.576 


9.383 


SM I NT20047290 


o 


39.6 


o 


o 


SPLEN20063250 


o 


8.081 


8.225 


8.309 


SPLEN20117580 


o 


0 


0 


24.462 


SPLEN20125230 


o 


0 


21.559 


0 


TESTI20030610 


o 


o 


o 


12.205 


TESTI20043910 


o 


o 


o 


45.963 


TESTI20066280 


o 


0 


0 


30.676 


TEST120067480 


o 


8.861 


o 


o 


TESTI20105130 


o 


o 


0 0 


2.5 


TESTI20106170 


o 


13.104 


o 


o 


TESTI20143180 


o 


0 


62.741 


0 


TESTI20221 790 


o 


o 


39.1 67 


o 


TESTI20254090 


0 


0 


0 


19.063 


TESTI20274960 


0 


0 


38.685 


0 


THYMU 10004280 


0 


3.573 


0 


0 


THYMU20007020 


0 


0 


71.017 


0 


THYMU20 104480 


0 


0 


29.694 


0 


THYMU20139160 


0 


1.822 


0.618 


0 


TRACH20026640 


0 


0 


7.476 


0 


UTERU1 0001 060 


0 


0 


19.967 


0 


UTERU20026620 


0 


10.41 


0 


10.703 


UTERU20079240 


0 


0 


0 


19.734 


UTERU20083020 


0 


7.182 


0 


0 


UTERU201 02260 


0 


0 


0 


23.706 


UTERU201 32620 


0 


48.06 


0 


0 



Table 5 



Clone ID 


BEAST 


TBAES 


CTONG20070780 


0 


97.283 


CTONG20084660 


0 


89.108 


HLUNG20045340 


0 


85.362 
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Table 5 (continued) 



Clone ID 


BEAST 


TBAES 


TESTI20047370 


0 


71.55 



5 



Table 6 



w 



15 



25 



Clone ID 


CERVX 


TCERX 


SMINT20030740 


0 


65.795 


Table 7 


Clone ID 


COLON 


TCOLN 


UTERU2004037 


68.149 


0 


Table 8 


Clone ID 


NESOP 


TESOP 


HLUNG20015180 


51.695 


0 


NESOP20004520 


100 


0 


NESOP20005040 


100 


0 


TESOP1 0000350 


0 


86.127 


TESOP1 0001 600 


0 


100 


THYMU20071120 


0 


81.712 



30 Table 9 



40 



50 



55 



Clone ID 


KIDNE 


TKIDM 


ASTR020009140 


19.518 


0 


ASTR020027330 


0 


30.903 


ASTR020055930 


0 


36.981 


BGG1 12001 0750 


4.532 


0 


BNGH420074600 


0 


11.358 


BRACE20050870 


0 


15.884 


BRACE20054480 


29.719 


0 


BRACE20062580 


2.613 


0 


BRACE20219360 


59.494 


0 


BRAMY20003540 


4.676 


0 


BRAMY20003880 


16.882 


41.527 


BRAMY20043630 


3.51 


0 


BRAMY20055760 


0 


65.196 


BRAMY201 25360 


0 


32.672 


BRAMY201 90550 


0 


19.772 


BRAMY20204270 


3.618 


0 


BRAMY20227860 


0 


4.255 


BRAWH20014590 


0 


23.509 


BRAWH20093070 


0 


14.759 


BRHIP10001040 


0 


5.496 


CTONG20033750 


59.93 


0 


CTONG20039370 


59.93 


0 


CTONG20045500 


59.93 


0 
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Table 9 (continued) 



10 



15 



20 



25 



30 



35 



40 



45 



55 



Clone ID 


KIDNE 


TKIDN 


CTONG20079590 


0 


17.392 


FCBBF20023490 


14.59 


0 


FCBBF30004340 


0 


10.386 


FCBBF301 06950 


0 


69.888 


FCBBF301 15230 


0 


69.888 


FCBBF30 169280 


0 


38.857 


FCBBF30225930 


0 


50.898 


FCBBF30282020 


0 


53.714 


FEBRA20038330 


0 


6.538 


FEBRA20039260 


7.084 


0 


FEBRA20040290 


7.711 


o 


FEBRA20082660 


0 


77.359 


FEBRA20121200 


o 


63.078 


FEBRA201 70240 


0 


63.078 


HEART1 0001490 


20.269 


0 


HLUNG20041590 


4.744 


0 


HLUNG20068120 


11 .84 


o 


HLUNG20072450 


3.599 


o 


HLUNG20083480 


o 


15.004 


HLUNG20083960 


18.946 


o 


KIDNE20011600 


100 


o 


KIDNE20016360 


59.589 


o 


KIDNE20024380 


100 


o 


KIDNE20027980 


100 


o 


KIDNE20080690 


5.861 


o 


KIDNE20081170 


100 


o 


KIDNE20083150 


1 00 


o 


KIDNE20083620 


100 


o 


KIDNE20084030 


40.03 


0 


KIDNE20084040 


34.084 


0 


KIDNE20084730 


1 00 


o 


KIDNE20084800 


1 00 


o 


KIDNE20086490 


87.61 


o 


KIDNE20086660 


47.013 


o 


KIDNE20086Q70 


1 00 


o 


KIDNF20087880 


28.683 


o 


KIDNF20088240 


1 00 


o 


KIDNE20089870 


3.987 


0 


KIDNE20091090 


100 


0 


KIDNE20094260 


100 


0 


KIDNE20094670 


59.494 


0 


KIDNE20095530 


100 


0 


KIDNE20133460 


100 


0 


KIDNE20133880 


100 


0 


KIDNE20134130 


65,363 


0 


KIDNE20134890 


100 


0 


KIDNE20137310 


100 


0 


KIDNE201 38450 


38.971 


0 


KIDNE20140870 


22.93 


0 


KIDNE20141120 


100 


0 
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Table 9 (continued) 



Clone ID 


KIDNE 


TKIDN 


KIDNE20141700 


100 


0 


KIDNE20142680 


100 


0 


KIDNE20142680 


100 


0 


KIDNE201 42900 


31.732 


0 


KIDNE20143200 


100 


0 


KIDNE20147170 


100 


0 


KIDNE20148080 


100 


0 


KIDNE20149780 


60.365 


0 


KIDNE201 50730 


100 


o 


KIDNE20152440 


100 


0 


KIDNE20154330 


100 


0 


KIDNE20154830 


100 


0 


K1DNE20155980 


100 


o 


KIDNE20157100 


100 


0 


KIDNE20160360 


1 00 


0 


KIDNE20160960 


1 00 


0 


KIDNE20163710 


1 00 


0 


KIDNE20165390 


100 


o 


KIDNE201 691 80 


1 00 


0 


KIDNE201 70400 


19.556 


o 


KIDNE20173150 


1 00 


o 


KIDNE201 73430 

1 X 1 I— / l >i 1— £— \J 1 / W*TWW 


36.673 


o 


KIDNE201 76030 


100 


o 


KIDNE20181670 


100 


o 


KIDNF201 82540 


1 00 


o 


KIDNE20186170 


100 


o 


KIDNE201 88630 

INI 1— / 1 N I — C— \J 1 UL"JU w 


100 


o 


KIDNE201 89890 

(ML/I X^Cv tUCWCv 


1 00 


o 


KIDNE201 89960 


100 


o 


KIDNE20191870 


100 


o 


M FSAN20038520 


o 


1 1 .358 


MFSAN20041 380 


o 


53.625 


OCRRF2001 6390 


o 


23.211 


OCBBF201 42290 


o 


8.623 


OCRRF201 74890 


3? ?41 




PI APFR0061370 




OA QR1 


pi APFRnny^nQn 


1 0 501 




PLACE60181870 


49.921 


0 


PROST20016760 


3.992 


9.82 


PUAEN 10000650 


6.676 


0 


SMINT20039050 


0 


11.102 


SMINT20089210 


0 


12.885 


SPLEN20017610 


42.429 


0 


SPLEN20024930 


0 


22.541 


SPLEN20057830 


0 


35 


SPLEN20063250 


3.996 


0 


SPLEN20126110 


50.05 


0 


SPLEN20135030 


31.695 


0 


SPLEN20136700 


0 


39.807 


TESTI20070740 


0 


75.118 



1 



37 



EP 1 308 459 A2 



Table 9 (continued) 



Clone ID 


KIDNE 


TKIDN 


TESTI20262150 


40.96 


0 


THYMU20009500 


0 


24.928 


THYMU200 19260 


0 


36.869 


THYMU201 57620 


0 


19.973 


TKIDN1 0000620 


0 


100 


TKIDN10001710 


0 


100 


TKIDN1 0001 920 


0 


36.869 


TRACH20011010 


10.647 


0 


UMVEN10001380 


0 


4.196 



Table 10 



Clone ID 


LIVER 


TLIVE 


CTONG20069320 


20.862 


0 


FCBBF30236670 


3.257 


0 


FEBRA20038220 


77.547 


0 


FEBRA20039260 


8.806 


0 


K1DNE20087880 


71.317 


0 


LIVER20006260 


100 


0 


LIVER20007690 


85.922 


0 


LIVER20007750 


100 


0 


LIVER20010510 


100 


0 


LIVER20010760 


100 


0 


LIVE R2001 0990 


100 


0 


LIVE R2001 1640 


100 


0 


LIVER20013890 


100 


0 


LIVER20026440 


100 


0 


LIVER20030650 


100 


0 


LIVER20032340 


100 


0 


LIVER20038000 


100 


0 


LIVER20040740 


100 


0 


LIVER20055270 


100 


0 


MESAN20027240 


21.684 


0 


NT2RI20021200 


16.028 


0 


SKMUS20006790 


6.063 


0 


TESTI20035330 


22.045 


0 


THYMU10004280 


8.787 


4.346 


THYMU20029830 


31.692 


0 



Table 11 



Clone ID 


HLUNG 


TLUNG 


HLUNG20052300 


23.611 


0 


SMINT20035050 


6.135 


0 


HLUNG20041590 


10.207 


0 


PROST20016760 


8.589 


0 


BRAMY20043630 


15.102 


0 


HLUNG20015180 


4.723 


0 


THYMU20139160 


1.939 


0 
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Table 11 (continued) 



5 



10 



15 



20 



25 



30 



35 



45 



55 



Clone ID 


HLUNG 


TLUNG 


HLUNG20020850 


67.488 


0 


HLUNG20032460 


44.037 


0 


BRAMY20204270 


7.785 


0 


BRAMY20001510 


5.948 


0 


BRAMY20227860 


1.488 


0 


CTONG20029030 


28.504 


0 


CTONG201 68460 


76.291 


0 


CTONG201 86290 


61.67 


0 


FEBRA20039260 


7.62 


0 


FEBRA20078800 


33.686 


0 


FEBRA201 63980 


38.327 


0 


HCHON20000870 


23.288 


0 


HLUNG20008460 


67.54 


0 


HLUNG20009260 


100 


0 


HLUNG20009550 


100 


0 


HLUNG20010130 


100 


0 


HLUNG20011260 


100 


0 


HLUNG20011440 


100 


0 


HLUNG20011460 


76.577 


0 


HLUNG20012140 


100 


o 


HLUNG20014590 


36.045 


0 


HLUNG20015070 


17.804 


o 


HLUNG20020500 


100 


o 


HLUNG20021450 


68.006 


o 


HLUNG20023030 


100 


o 


HLUNG20024050 


100 


o 


HLUNG20025620 


100 


o 


HLUNG20028110 


76.618 


o 


HLUNG20029420 


100 


o 


HLUNG20029490 


81 .173 


o 


HLUNG20030420 


100 


o 


HLUNG20030490 


100 


o 


HLUNG2003061 0 


100 


o 


HLUNG20031 620 


80.237 


o 


HLUNG20033060 


36.529 


o 


HLUNG2003331 0 


100 


o 


HLUNG20033350 


100 


o 


HLUNG20034970 


79.349 


0 


HLUNG20037140 


100 


0 


HLUNG20037160 


100 


0 


HLUNG20037780 


44.761 


0 


HLUNG20038330 


100 


0 


HLUNG20041540 


100 


0 


HLUNG20042730 


100 


0 


HLUNG20045340 


7.67 


0 


HLUNG20047070 


100 


0 


HLUNG20050760 


100 


0 


HLUNG20051330 


100 


0 


HLUNG20054790 


100 


0 


HLUNG20055240 


100 


0 
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Table 11 


(continued) 




Clone ID 


HLUNG 


TLUNG 


HLUNG20056560 


75.961 


0 


HLUNG20057380 


100 


0 


HLUNG20059240 


100 


0 


HLUNG20060670 


100 


0 


HLUNG20063700 


100 


0 


HLUNG20065700 


62.8 


0 


HLUNG20065990 


100 


0 


HLUNG20067810 


100 


0 


HLUNG20068120 


50.947 


o 


HLUNG20069350 


100 


0 


HLUNG20070410 


100 


0 


HLUNG20072100 


54.241 


0 


HLUNG200721 90 


79.349 


o 


HLUNG20072450 


7.744 


0 


HLUNG20074330 


100 


o 


HLUNG20079310 


100 


0 


HLUNG20081 390 


66.429 


o 


HLUNG20081 530 


100 


o 


HLUNG20082350 


100 


o 


HLUNG20083330 


100 


o 


HLUNG20083480 


13.123 


0 


H LU N G20083840 


100 


o 


HLUNG20083960 


40.76 


o 


HLUNG20084790 


100 


o 


HLUNG20085210 


50.993 


o 


HLUNG20088750 


100 


o 


HLUNG20092530 


100 


o 


H LU N G20093030 


100 


o 


HLUNG200941 30 


75.987 


o 


KIDNE201 42900 


68.268 


o 


PROST20052850 


57.701 


0 


SKNMC20006350 


2.134 


0 


SPLEN20012450 


25.695 


0 


TESTI20057590 


17.804 


0 


TESTI20061200 


29.123 


0 


TESTI20067480 


18.856 


0 


TESTI20116050 


30.168 


0 


THYMU1 0004280 


7.603 


0 


THYMU20010180 


79.349 


0 


TRACH20011010 


22.907 


0 


UTERU20016580 


43.64 


0 


UTERU201 27030 


66.318 


0 



Table 12 



Clone ID 


NOVAR 


TOVAR 


KIDNE20089870 


91.868 


0 


NT2RP70075300 


76.633 


0 


TESTI201 32310 


94.177 


0 
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Table 13 



Clone ID 


STOMA 


TSTOM 


BNGH420087430 


0 


91.629 


BRAMY20227860 


1.35 


0 


BRAWH20027250 


42.096 


0 


CTONG201 74440 


26.346 


0 


FEBRA20090220 


4.442 


0 


PUAEN1 0000650 


13.031 


0 


SMINT20023110 


72.094 


0 


SMINT20030740 


5.514 


0 


SMINT20045890 


34.092 


0 


SPLEN20048800 


2.011 


0 


SPLEN20139360 


79.641 


0 


TESTI20063410 


28.273 


0 


TESTI20 150920 


33.158 


0 


TRACH20026640 


21.272 


0 


UTERU20041970 


0 


72.886 



Table 14 



Clone ID 


UTERU 


TUTER 


ADRGL20020290 


21.538 


0 


BRACE20038920 


10.185 


0 


BRAMY20091230 


39.224 


0 


BRAMY20093490 


62.465 


0 


BRAMY20227860 


2.268 


0 


BRHIP20005060 


61.644 


0 


CTONG20069320 


18.336 


0 


CTONG20083430 


62.039 


0 


FCBBF30005360 


41.409 


0 


FCBBF30257370 


21.099 


0 


FEBRA20038330 


5.808 


0 


FEBRA20039260 


7.74 


0 


FEBRA20040260 


43.146 


0 


FEBRA20078180 


31.447 


0 


FEBRA20087550 


10.505 


0 


HLUNG20015070 


9.042 


0 


HLUNG20015180 


7.196 


0 


MESAN20007110 


50.674 


0 


MESAN20067430 


20.114 


0 


MESAN20095800 


14.162 


0 


NT2RP70057500 


14.698 


0 


SKMUS20008730 


2.775 


0 


SKNMC20006350 


1.084 


0 


SMINT20035050 


3.116 


0 


SMINT20045890 


19.084 


0 


SPLEN20073880 


30.142 


0 


SPLEN20076470 


17.567 


0 


SPLEN201 18050 


33.583 


0 


TESTI20030610 


12.829 


0 
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Table 14 


(continued) 




Clone ID 


UTERU 


TUTER 


TESTI20035330 


9.688 


0 


TESTI20057590 


9.042 


o 


TESTI20059080 


9.096 


o 


TESTI20105130 


0 


57.51 


THYMU1 0004280 


3.861 


o 


THYMU20139160 


1 .969 


21 .546 


UTERU1 0001 060 


31 .804 


o 


UTFRU1 0001 870 


1 00 


o 


UTFRU20000230 


34.999 


o 


LITE RU20000QS0 


3.88 


o 


UTFRLJ20011 7fi0 


1 00 


Q 


UTF RU2001 3890 


100 


o 


UTERU2001 6580 


22.1 64 


o 


UTF RU20026620 


33.752 


o 


I JTFRt J2nn27^fif) 


1 00 


o 


UTF RU20029930 


72.842 


o 


UTFRLJ20011 ^50 


1 00 


o 


UTF RL J200^^770 


1 00 


o 


UTFRUPnodm ^n 


1 00 


o 


UTFRU?0040' : i70 


R PR4 


u 


UTF RL J20040^Q0 


1 00 


u 


I JTF Rl 1200407^0 




n 

u 


i itfri ipnndifi^n 


1 00 


n 

u 


UTFRU20041 Q70 


6.578 


o 


UTF RU2004S200 


62.039 


o 




1 nn 
i uu 


n 

u 


UTERU20064120 
\j • t nut uuu*+ i tu 


100 


o 


UTF RU2006S470 
' i ll rn»j^.uuo^j*+ / u 


50.58 


o 


UTF RLJ2007Q240 


20.742 


o 


UTF RL J2008^020 


7.762 


o 


i itfri lonnRfi^n 


1 nn 


u 


i itfri is>nnR7n7n 


1 nn 
i uy 


u 


i itfri ionnft7ftRn 


1 nn 

I UU 


n 
u 


i itfri i^nnnQ^nn 


£. f .DOJ 


u 


1 ITFRI l9nnRQQQf» 


I / .OO / 


u 


I itfri i^nnnQf^fi 
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u 
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UTERU20094830 


100 


0 


UTERU20095100 


39.739 


0 


UTERU20099040 


100 


0 


UTERU20099510 


100 


0 


UTERU20101150 


100 


0 


UTERU201 02260 


24.918 


0 


UTERU201 03040 


100 


0 


UTERU201 03200 


51.815 


0 


UTERU201 04310 


100 


0 


UTERU20106510 


100 


0 


UTERU20121140 


100 


0 


UTERU201 22520 


51.33 


0 
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Table 14 


(continued) 




Clone ID 


UTERU 


TUTER 


UTERU201 25810 


100 


0 


UTERU201 27030 


33.682 


0 


UTERU20127150 


62.412 


0 


UTERU201 28560 


36.129 


o 


UTERU201 32620 


51 .94 


o 


UTERU201 34830 


100 


0 


UTERU201 39760 


100 


0 


UTERU20140010 


100 


0 


UTERU201 67570 


100 


0 


UTERU201 68960 


100 


0 


UTERU201 69020 


100 


0 


UTERU201 73030 


100 


0 


UTERU201 76230 


72.842 


0 


UTERU20177150 


100 


0 


UTERU20181270 


100 


0 


UTERU201 85220 


100 


0 


UTERU201 88670 


100 


0 


UTERU201 88840 


40.303 


0 


Table 15 


Clone ID 


NTONG 


CTONG 


ADRGL20023920 


13.989 


0 


BRACE20038920 


0 


24.929 


BRACE20050870 


0 


34.54 


BRACE20061620 


63.015 


0 


BRAMY20036530 


21.217 


0 


BRAMY20076130 


0 


6.434 


BRAMY20204270 


0 


2.419 


BRAMY20267780 


0 


4.633 


BRCAN1 0001 680 


14.71 


0 


CTONG1 0000090 


0 


87.752 


CTONG20000340 


0 


100 


CTONG20002790 


0 


100 


CTONG20004110 


0 


100 


CTONG20004520 


0 


100 


CTONG20007660 


0 


100 


CTONG20008190 


0 


100 


CTONG20008460 


0 


100 


CTONG20015240 


0 


100 


CTONG200 17490 


0 


100 


CTONG20020660 


0 


100 


CTONG20020950 


0 


100 


CTONG20027660 


0 


49.358 


CTONG20029030 


0 


44.292 


CTONG20030280 


0 


100 


CTONG20031150 


0 


100 


CTONG20031890 


0 


62.139 


CTONG20032930 


0 


29.763 
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Table 15 (continued) 
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0 
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20 


CTONG20050490 


0 
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0 
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0 
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0 
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25 
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0 
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0 
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0 


24.814 




CTONG200561 50 


0 
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0 
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30 
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0 


54.423 




CTONG200591 30 


0 


72.445 




CTONG20060040 


0 


72.825 




CTONG20061 290 


0 


100 


35 
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0 
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CTONG20063770 


0 


62.139 
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0 


28.37 




CTO NG20065240 


0 
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40 
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0 
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0 
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0 
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45 
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0 
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0 
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CTONG20070780 


0 


2.717 




CTONG24070910 


0 
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0 


32.374 


50 


CTONG20071680 


0 


49.582 




CTONG20072930 


0 


100 




CTONG20073990 


0 


100 




CTONG20074000 


0 


100 


55 


CTONG20074170 


0 


55.786 


CTONG20074740 


0 


100 




CTONG20076230 


0 


100 




CTONG20076810 


0 


100 
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Table 15 


(continued) 




Clone ID 


NTONG 


CTONG 
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0 
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0 
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0 
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0 
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7.487 


0 
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0 


2.578 
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Table 15 (continued) 



5 



to 



15 



20 



25 



30 



35 



40 



45 



55 



Clone ID 


NTONG 


CTONG 
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0 
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Table 15 


(continued) 




Clone ID 


NTONG 


CTONG 


SPLEN20040780 


o 


20.5 


SPLEN20063250 


0 


8.016 


SPLEN201 81570 


0 


4.523 


SPLEN201 87490 


o 


23.141 


TEST 120047370 


o 


1 .998 


TESTI20057880 


0 


1 5.357 


TEST120064530 


0 


10.734 
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o 


8.166 
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o 


4.825 
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o 
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o 
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THYMU 10004280 


o 
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4.53 
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0 
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0 
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0 
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THYMU20139160 
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0 
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0 
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0 
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0 
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0 
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0 


13.562 
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0 
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UTERU20083020 


15.017 


0 
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Table 16 
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Table 17 
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0 


19.518 


BGGI1 2001 0750 


0 


4.532 


BRACE20054480 


0 


29.719 


BRACE20062580 


0 


2.613 


BRACE202 19360 


0 


59.494 


BRAMY20001510 


68.103 


0 


BRAMY20003540 


0 


4.676 


BRAMY20003880 


0 


16.882 


BRAMY20043630 


0 


3.51 


BRAMY20204270 


0 


3.618 


CTONG20033750 


0 


59.93 


CTONG20039370 


0 


59.93 


CTONG20045500 


0 


59.93 


FCBBF20023490 


0 


14.59 


FEBRA20039260 


0 


7.084 


FEBRA20040290 


0 


7.711 
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Table 38 


(continued) 




Clone ID 


FEKID 


KIDNE 


ucA RT1 0001 4Q0 I 


n 


20.269 




o 


4.744 




n 
yj 


11 .84 


hi i iMf^onn7P/iRn 


RR R^7 




HI 1 tMf^PnnR^QRn 
nLU INClcUUOOc70U 


n 
u 


"1 R QAR 


ki nw P^nm 1 finn 


n 




KinMPonoi R*3 

r\IUI>J C^UU 1 OJ 


n 


CiQ RRQ 




n 
u 


1 nn 


r\lulMCcUU£/ you 




1 nn 


r\l U) N CI^lUUOUOcJU 




O.OD I 


l\l UN t*sUUo I l /u 


u 


•i nn 
I UU 


i/inM ponnn^i *^n 


n 
u 


1 nn 

l UU 


t\l UN t^UUooQ^iU 


u 


1 nn 
1 uu 


l\l UN t/iUUo4UoU 


n 
u 


/in n*3 


l\l UN t*lUUo4U4U 


n 

u 


o4.Ut>4 


1 p\M Conn Q.A 7Qf» 
l\IUN t2UUo4/oU 


n 

u 


i nn 
1 UU 


i^irMMConnQ/iQnn 
X\\ UN t2UUo4oUU 


n 

0 


h nn 
1 UU 


MUN t2UUot>4yU 


n 

u 


07 


rMUN t^UUODDOU 


n 
U 


a ~r n H Q 


imun t^uuooy / u 


u 


1 nn 
1 uu 


U'lP^M connovoan 


n 
u 


/lO.boo 


f\l UN t*iUUOO<i4U 


n 
U 


i nn 
1 UU 


i/ir\M COnnOQD7A 

i\iun t2uuoyo/u 


n 
U 


o.yt>/ 


r\iUN t^uuy i uyu 


n 
U 


i nn 
1 UU 


l\l DIM fc:2UUy42oU 


n 
U 


-< nn 
1 UU 


r\ i u n t ^ u u y 4o / u 


n 
U 


CO AQA 

oy.4y4 


m un t/iuuyooou 


U 


-4 nn 
1 UU 


l\l UN t^UlOo4DU 


n 
U 


m nn 

1 UU 


l\IUNb2U 1 JoooU 


n 
0 


1 UU 


YSl p\M conn Oyl-ion 

r\l UN t20 i o4l oO 


n 
0 


bo.obo 


t\l UN t2Ul o4oyu 


n 
U 


^ nn 
1 UU 


r\iUNt2Uio/o i u 


U 


i nn 
1 UU 


i^ir^M QQ/iKn 
MUN t2U 1 oo40U 


n 
U 


oo.y / 1 


f\l UN b2 0 1 4Uo / U 


0 


22.93 


KIUNb2Q1 41120 


0 


1 00 


1/inMronH a -4 vnn 
K.IUNL20 14 1 /OU 


0 


1 00 


l/ir\M r~on H yiocort 

l\IUNb20 i42oo0 


n 
0 


h nn 
1 00 


a oonn 
Kl UN b20 142900 


0 


31 .7o2 


KIDNE201 43200 


o 


100 


KIDNE20147170 


0 


100 


KIDNE20148080 


0 


100 


KIDNE20149780 


0 


60.365 


KIDNE201 50730 


0 


100 


KIDNE20152440 


0 


100 


KIDNE201 54330 


0 


100 


KIDNE201 54830 


0 


100 


KIDNE20155980 


0 


100 


KIDNE20157100 


0 


100 


KIDNE20160360 


0 


100 


KIDNE201 60960 


0 


100 
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Table 38 


(continued) 






Clone ID 


FEKID 


KIDNE 




KIDNE201 63710 


o 


1 00 


5 


KIDNF201 65390 


o 


100 




KIDNE20169180 


0 


100 




KIDNF20 170400 


o 


19.556 




KIDNF201731 ^0 


o 


100 




KIDNF20 173430 


o 


36.673 


10 


KIDNF201 76030 


o 


1 00 




KIDNF201R1670 


o 


1 00 




KIDIMF201 82540 


o 


100 




KiriWF2ni R6170 


o 


1 00 


15 


F\l U\ N C<L\J I OODOU 


o 


1 00 




KIDNF201 RQflQO 


o 


1 00 




k'mMPom RQQRfi 




1 on 




r\t uin Cilkj i t7 lo/u 


n 


1 oo 




OPRRFPfll 74ftQO 


o 


32.241 


20 


pi APFRnn7^nQn 


o 


1 0.501 




rLMUCDU lolD/U 


A 
\J 


4Q QS>i 




PROST20016760 


0 


3.992 




PUAEN1 0000650 


0 


6.676 


25 


SKNMC20006350 


24.429 


0 




SPLEN20017610 


0 


42.429 




SPLEN20063250 


0 


3.996 




SPLEN20126110 


0 


50.05 




SPLEN201 35030 


0 


31.695 


30 


TESTI20061200 


0 


13.537 




TESTI20262150 


0 


40.96 




THYMU1 0004280 


0 


3.534 




THYMU20139160 


0 


2.704 


35 


TRACH20011010 


0 


10.647 



Table 39 



50 



Clone ID 


FELNG 


HLUNG 


BRAMY20001510 


0 


5.948 


BRAMY20043630 


0 


15.102 


BRAMY20204270 


0 


7.785 


BRAMY20227860 


0 


1.488 


CTONG20029030 


0 


28.504 


CTONG201 68460 


0 


76.291 


CTONG201 86290 


0 


61.67 


FEBRA20039260 


0 


7.62 


FEBRA20078800 


0 


33.686 


FEBRA201 63980 


0 


38.327 


HCHON20000870 


0 


23.288 


HLUNG20008460 


0 


67.54 


HLUNG20009260 


0 


100 


HLUNG20009550 


0 


100 


HLUNG20010130 


0 


100 


HLUNG20011260 


0 


100 
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Table 39 (continued) 



w 



15 



20 



25 



30 



35 



40 



45 



55 



Clone ID 


FELNG 


HLUNG 


HLUNG20011440 


o 


100 


HLUNG2001 1 460 


o 


76.577 


HLUNG2001 21 40 


o 


100 


HLUNG20014590 


o 


36.045 


HLUNG2001 5070 

1 1 Urn \mf t * X ACh W V 1 WW / V 


o 


1 7.804 


HLUNG20015180 


o 


4 723 


HI UNG20020500 

1 1 l—^^J 1 il \J w£. WW w w 


o 


1 00 


HI UNG20020850 


0 


67.488 


HI I JNIG200214EO 


u 


DO.UUD 




o 


100 


HI UNGPOOPdOSO 


0 


1 00 


hi i tMf^9nn9RRC>n 


u 


1 nn 

I UU 


hi I iMf^c>nni>Ri 1 n 


n 
u 


/ O.O ID 


hi UNG2nn?Q4Pn 


n 


inn 

I UU 


Hi I iMr;onri9Q4Qn 
nLUiM y*f i?u 


u 


Ol. I/O 


hi i iw^onn^n4or» 

n L_ 1 M \JLCi. UUO U4tU 


u 


i nn 

1 UU 


Hi 1 ]Mf^c>nrj9n4Qn 

n l_ \J 1 M V3<£ vJ UO U*+i7 u 


u 


1 nn 

I UU 


ni_i«ji>jvj»^uuouo i u 


u 


1 nn 

I UU 


Ml I iMf^nmi Ron 


u 


Oft 007 


mi I iNrionmo^Rn 

n l_ U I \l U UOt'+D u 


u 




nLUIMU^UUOoUOU 


u 


con 


hi i iMf^nn^T^i n 


n 
U 


1 nn 

1 UU 




n 
u 


1 nn 

I UU 


hi I iNf^nn^ziQTn 


n 
u 


"7Q Q/IO 

/y.o^y 


nLUI\U£UUO/ 1 *+U 


n 
u 


1 nn 

1 UU 


HI MMl^^>nn^7i firt 

n t_ 1 N \Jk£. UUO 1 1 DU 


n 
u 


i nn 

1 UU 


IIUJINVJ^UUO/ /ou 


n 
u 


/Ol 




u 


1 nn 
l UU 


nLUINVja^UU**-! O^l-U 


u 


1 UU 


Hi I iMf^onnAi con 


n 
U 




hi I iMfiQnn4^7Qn 
nnjiwji^uuH-^. /ou 


u 


-t nn 
i UU 


n l u rvi \jk£. uu4oo4U 


U 


7.67 




U 


1 UO 


hi I iMf^o nn^nT«n 
ni-wiNia^uuou / ou 


u 


I UU 


rjLursio^UUOi ooU 


n 
u 


1 00 


rl LU IN La^c UUO <ioU U 


U 


23.611 


rl LU IM La^i UUO^J- / y u 


n 

u 


100 


HLUNG20055240 


0 


100 


HLUNG20056560 


0 


75.961 


HLUNG20057380 


0 


100 


HLUNG20059240 


0 


100 


HLUNG20060670 


0 


100 


HLUNG20063700 


0 


100 


HLUNG20065700 


0 


62.8 


HLUNG20065990 


0 


100 


HLUNG20067810 


0 


100 


HLUNG20068120 


0 


50.947 


HLUNG20069350 


0 


100 


HLUNG20070410 


0 


100 


HLUNG20072100 


0 


54.241 
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w 



15 



20 



25 



30 



35 



40 



Table 39 


(continued) 




Clone ID 


FELNG 


HLUNG 


HLUNG20072190 


0 


79.349 


HLUNG20072450 


0 


7.744 


HLUNG20074330 


0 


100 


HLUNG20079310 


0 


100 


HLUNG20081390 


0 


66.429 


HLUNG20081530 


0 


100 


HLUNG20082350 


0 


100 


HLUNG20083330 


0 


100 


HLUNG20083480 


0 


13.123 


HLUNG20083840 


0 


100 


HLUNG20083960 


0 


40.76 


HLUNG20084790 


o 


100 


HLUNG2008521 0 


o 


50.993 


HLUNG20088750 


o 


100 


HLUNG20092530 


o 


100 


HLUNG20093030 


o 


100 


HLUNG20094130 


o 


75.987 


KIDNE201 42900 


o 


68.268 


PROST2001 6760 


o 


8.589 


PROST20052850 


o 


57.701 


SKNMC20006350 


o 


2.134 


SMINT20035050 


0 


6.135 


SPLEN20012450 


0 


25.695 


TESTI20057590 


0 


17.804 


TESTI20061200 


0 


29.123 


TESTI20067480 


0 


18.856 


TESTI201 16050 


0 


30.168 


THYMU1 0004280 


0 


7.603 


THYMU20010180 


0 


79.349 


THYMU20139160 


0 


1.939 


TRACH20011010 


0 


22.907 


UTERU20016580 


0 


43.64 


UTERU20127030 


0 


66.318 



Table 40 



45 



50 



55 



Alteration of the expression level of each clone due to TNF-a stimulation to human monocyte cell line THP-1 
and alteration of the expression level of each clone due to co-culture of gastric cancer cell line MKN45 with Helico- 
bacter pylori ctl, TNF_1 h, and TNF_3h in the column of THP-1 , respectively, indicate the relative mRNA expression 
levels in unstimulated THP-1 , in the cell stimulated with 10 ng/mL TNF-a for 1 hour, and in the cell stimulated with 
10 ng/mL TNF-a for 3 hours; ctl, Hp, and AcagE in the column of MKN45 indicate the relative mRNA expression 
levels in MKN45 cultured without Helicobacter pylori, in the cells co-cultured with cag PAI-positive Helicobacter 
pylori (TN2) (at a ratio of MKN45 : TN2 = 1 :100 cells (colonies)) for 3 hours, and in the cells co-cultured with the 
cagE mutant (TN2AcagE) (at a ratio of MKN45 : TN2AcagE = 1:100 cells (colonies)) for 3 hours, respectively. 
[ATAC-PCR] 


Clone name 


THP-1 


MKN45 




ctl 


TNF_1 h 


TNF_3h 


ctl 


Hp 


AcagE 


ASTR020045840 
ASTR020055930 


1.5 
0.8 


2.3 
1.9 


1.9 
1.4 


2.3 
0.8 


2.4 
0.8 


0.2 
0.5 
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Table 40 (continued) 



5 



10 



55 



Clone name 


THP-1 


MKN45 


ctl 


TNF_1h 


TNF_3h 


ctl 


Hp 


AcagE 


ASTR020088950 


1.0 


0.2 


2.5 


1.1 


0.3 


0.3 


BNGH420052350 


2.2 


2.2 


0.0 


0.5 


0.5 


3.2 


BRACE20052530 


2.6 


1.0 


0.3 


2.2 


1.0 


0.9 


BRACE20054080 








0.8 


1.1 


1.0 


BRAMY20003880 


1.5 


0.9 


0.6 


1.2 


0.0 


1.3 


BRAMY20027390 


0.6 


4.2 


0.1 


2.9 


0.4 


0.1 


B RAM Y20028530 








0.5 


3.4 


4.1 


BRAMY20035380 


1.3 


0.9 


0.8 


0.5 


1.7 


0.5 


BRAMY20036530 








1.1 


0.3 


0.3 


BRAMY20050940 


0.4 


0.0 


0.0 


0.2 


0.2 


1.0 


B RAM Y20072440 








0.5 


0.5 


0.1 


BRAMY20096930 


3.8 


5.2 


4.7 


1.7 


2.4 


1.7 


BRAMY201 1 841 0 


0.8 


3.7 


0.8 


2.7 


0.0 


0.0 


BRAMY202371 90 


0.0 


2.3 


0.1 








BRAWH20055330 


2.5 


4.7 


2.8 


1.2 


1.3 


0.0 


BRAWH20078620 








1.9 


1.2 


1.0 


BRAWH201 90530 


0.6 


0.1 


0.0 


0.3 


0.0 


0.0 


BRCAN20001 480 


1.0 


3.4 


4.1 


1.4 


1.6 


0.6 


RRHIP1 0000720 


0.3 


1.8 


1.4 


0.9 


0.0 


1.7 


RRHIP1 0001 040 

L-J 111 111 1 UUU 1 V^rW 


0.9 


0.7 


0.1 


0.0 


0.1 


0.0 


RRHIP2000021 0 


0.6 


0.6 


0.0 


2.0 


0.3 


0.0 


BRSSN20001 970 


0.8 


1.4 


1.3 


0.8 


0.7 


0.5 


BRSSN20091 1 90 








0.6 


0.1 


0.8 


CD34C20001 750 


0.0 


0.4 


2.0 








CTONG20078340 


0.3 


2.6 


1.6 


0.9 


0.7 


2.4 


CTONG20079590 


1.0 


1.2 


0.2 


0.1 


0.0 


0.0 


CTONG20083980 


0.0 


0.0 


0.0 


1.3 


1.8 


1.9 


CTONG2008521 0 


0.8 


1.2 


2.3 


0.1 


0.2 


0.1 


DFNFS200634R0 


1.7 


3.6 


2.7 


1.3 


2.0 


0.1 


nFNF < ^20072QQ0 


1.4 


1.9 


2.0 


5.0 


4.9 


4.2 




1.8 


5.5 


3.8 


2.3 


2.2 


2.3 




0.1 


0.1 


0.0 


1.7 


0.5 


0.5 


FHRRFPOO^fi^fiO. 


0.6 


0.7 


0.4 


0.4 


0.2 


0.2 


FCRRF30022fiR0 


2.9 


1.0 


0.3 


2.9 


1.0 


0.3 


fp RRF^nnTRRnn 

roDDrouu/ ouuu 


1.1 


2.6 


0.6 








FCBBF301 05080 


1.8 


1.6 


1.9 


0.2 


0.1 


0.0 


FCBBF301 69870 


1.1 


1.2 


0.2 


1.6 


0.3 


0.3 


FCBBF30225930 


2.2 


0.8 


1.0 


1.1 


0.7 


0.2 


FCBBF50000610 


2.1 


2.8 


2.2 


2.0 


2.2 


1.3 


FEBRA20007820 


0.0 


1.7 


2.4 


2.1 


1.4 


1.2 


FEBRA20031280 


0.1 


1.8 


4.5 


0.5 


0.0 


0.0 


FEBRA20031810 


1.4 


3.9 


3.5 


1.5 


2.1 


1.9 


FEBRA20039260 


2.0 


3.0 


2.5 








FEBRA20046280 


1.3 


0.3 


0.3 








FEBRA20084750 








2.5 


2.2 


0.3 


FEBRA201 82030 


3.0 


4.0 


4.2 


1.6 


0.3 


0.7 


HLUNG20041540 


0.0 


2.2 


2.2 


1.9 


2.4 


0.2 


HLUNG20092530 


0.3 


0.3 


3.1 


0.2 


0.2 


0.9 



174 



EP 1 308 459 A2 



Table 40 (continued) 



5 



10 



40 



Clone name 


THP-1 


MKN45 


ctl 


TNF_1h 


TNF_3h 


ctl 


Hp 


AcagE 


KIDNE20084030 


1.6 


0.1 


0.3 


0.1 


0.0 


0.0 


KIDNE20084800 


0.6 


0.3 


0.0 


0.5 


0.5 


1.1 


KIDNE20134130 


0.4 


0.4 


0.5 


2.3 


1.2 


1.6 


KIDNE201 82540 








1.1 


0.3 


0.3 


KIDNF20186170 


0.0 


0.0 


0.0 


0.6 


0.0 


0.0 


KIDNF201 88630 


1.5 


0.5 


0.5 


0.4 


0.3 


0.0 


1 IVFR20007750 


1.3 


1.8 


0.3 


1.9 


0.7 


0.0 


M FSAN20021 220 


1.7 


2.6 


1.6 


2.3 


0.9 


0.3 


MF C 5AN200841 ^0 


0.8 


2.6 


2.2 


1.7 


2.1 


1.1 


MT2NF2005Q21 0 








1.4 


0.4 


0.1 


MT2N F200R2 1 30 

IN 1 ^IMC&UUO^ 1 OU 


1.8 


1.3 


0.5 


1.9 


0.3 


0.3 


MT2M F2nnQS>Q^n 


1.3 


2.7 


3.4 


1.7 


2.4 


1.8 


MT2RP7fin^1 070 
IN Itnr/u UO I U I U 


0.3 


0.9 


1.4 


0.4 


0.4 


0.0 




0.3 


0.3 


1.3 


0.9 


0.2 


1.2 


OPRRPom 1 ns>i n 


4.5 


1.8 


0.5 


0.5 


0.5 


4.5 


UuDDr^U I IU/OU 


0.4 


0.5 


0.3 


0.1 1 


0.0 


0.0 


UvDDrtU I OOUOU 


1.1 


1.1 


0.9 


2.5 


0.8 


0.3 


OPRRF901 RRQOn 
UODDrtU I 


1.7 


4.4 


4.4 


1.9 


3.3 


1.1 


nrRRP9ni 7rv3Rn 

UUDDi ^U I / UOOU 


0.9 


5.4 


0.3 


0.3 


1.7 


1.5 


nrRRF?m 7rr^o 


0.5 


1.3 


0.5 


0.7 


0.7 


0.0 


pi ArFRnnnR^nn 


0.8 


3.2 


1.0 


0.5 


0.5 


0.9 


pi APFRnnfii r \ic\ 

rLnbCDUUD IO/U 


2.7 


0.8 


1.5 


1.2 


1.0 


1.3 


PRo^Tonm 1 1 fin 


0.2 


1.7 


2.0 


1.3 


5.5 


1.7 


p RO^T^no/ii zifin 


2.9 


0.1 


1.5 


0.6 


0.0 


0.0 


PROQT^nnfitii on 
i nuo i ^uuoj i uu 


1.4 


1.9 


1.6 


5.3 


0.0 


0.0 




1.7 


0.5 


0.5 


2.6 


0.5 


0.5 


PROQT^ni nfinfin 

r nuo 1 ^U 1 UDUDU 


0.2 


0.2 


2.0 


1.0 


0.6 


0.7 


PROQT9H1 1 m ori 

l nUo 1 ^U 1 IU 1 ^U 


1.2 


0.6 


0.5 








orxivivjo^uuc? i y uu 


1.2 


1.6 


0.3 


1.2 


2.6 


0.3 


O IVI I IN I tUUt't I *+U 








0.0 


0.1 


0.0 


q m I MT9nnQ9 1 fin 
o ivi UN i *iuuy<i i ou 


0.5 


0.7 


0.3 


1.3 


1.2 


0.3 


o rLt rN^UU^t-U / OU 


1.0 


2.8 


1.9 


0.3 


0.8 


0.1 


Or^LtlN^U I I UOOU 


2.4 


0.1 


7.6 


1.4 


0.1 


0.1 


CDI CMOft177/lf>n 


0.8 


3.3 


1.3 


1.3 


0.7 


0.3 


1 to 1 l < cuUoo^'HJ 








0.1 


0.0 


0.0 


TESTI20043130 


0.0 


0.1 


0.7 








TESTI20046540 


1.1 


0.8 


0.2 


1.1 


0.8 


0.3 


TESTI20047370 


0.4 


0.4 


0.5 


0.6 


0.0 


0.5 


TESTI20057200 


2.5 


0.0 


1.2 


1.1 


0.4 


0.3 


TESTI20057590 


0.1 


0.1 


0.0 


3.8 


3.5 


2.2 


TESTI201 13940 


5.2 


0.2 


0.2 


4.4 


0.4 


0.4 


TESTI201 49880 








2.2 


0.2 


2.2 


TESTI20151800 


2.1 


3.3 


2.3 


2.5 


1.0 


0.3 


TESTI201 73050 


0.8 


0.6 


0.5 


1.8 


1.1 


1.0 


TESTI201 98600 








2.2 


0.2 


2.2 


TESTI20257910 


1.2 


0.3 


0.3 


1.2 


0.2 


0.7 


TESTI20262940 


1.5 


1.1 


0.2 


1.1 


1.3 


0.2 


THYMU20046770 








1.7 


0.5 


0.5 
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Table 40 (continued) 



Clone name 


THP-1 


MKN45 


oil 


i in r i n 


1 IN i ON 


rtl 
on 


Mp 


AcagE 


THYMU20058550 








1 .9 


0.1 


0.1 


THYMU20062520 


0.0 


0.0 


0.7 


0.2 


0.3 


0.1 


THYMU20062770 








1 .6 


1 .0 


0.3 


THYMU20078240 


0.3 


1 .7 


2.9 


0.0 


1 .0 


0.1 


THYMU20150190 


0.2 


0.2 


0.6 


1 A 


0.5 


1 .9 


TRACH201 25620 


1 .1 


2.5 


1 .5 


1 .4 


1 .4 


1 .0 


TRACH20 149740 


9.4 


9.4 


0.9 


1 .6 


2.1 


0.6 


TRACH201 90460 


2.0 


3.3 


3.1 


0.2 


1 .1 


0.2 


UTERU20045200 


1 .4 


2.6 


3.9 


0.9 


2.1 


2.5 


UTERU20064120 


0.6 


2.7 


2.0 


0.6 


1 .7 


0.4 


UTERU201 03200 


0.1 


0.0 


0.9 


2.4 


1.7 


0.9 


ADRGL20046760 








0.5 


0.5 


0.5 


ASTR020055530 


0.7 


2.4 


1.7 


0.8 


2.1 


1 .6 


BRAMY20076130 


0.1 


1.4 


0.2 


0.1 


0.0 


0.0 


CTONG201 70940 


1 .6 


0.8 


0.3 


0.5 


9.9 


0.5 


FCBBF20033360 


0.1 


0.4 


1.4 


6.4 


0.3 


0.7 


FCBBF30257370 


2.7 


0.3 


0.3 


2.2 


0.1 


2.2 


FCBBF50001650 


1 .0 


1 .6 


1 .2 


1 .3 


1.0 


0.9 


FEBRA20040290 


0.4 


1.9 


1.1 


0.4 


1.5 


1.4 


FEBRA20063720 


2.9 


3.3 


3.1 


1 .5 


0.7 


1 .4 


FEBRA20098040 


2.5 


2.1 


1.5 


4.2 


0.0 


0.3 


FEBRA201 08580 


1.4 


2.8 


2.8 


0.7 


0.2 


0.2 


MESAN20021860 


0.1 


1.2 


0.2 


0.9 


1 .0 


0.9 


MESAN20067430 


0.8 


3.0 


1.5 


0.0 


0.3 


0.0 


NT2NE20045190 


0.7 


0.2 


0.2 


0.8 


0.6 


0.2 


PROST200 16760 








1.1 


3.4 


2.3 


SKNSH20007160 


0.6 


0.3 


0.2 


1 1 


0.7 


0.6 


SMINT20006020 


2.5 


1.0 


0.3 


2.4 


0.3 


0.3 


TESTI20059370 


0.0 


0.2 


0.3 


0.0 


0.0 


0.0 


TESTI20 103690 


0.0 


0.0 


0.0 


0.8 


0.3 


0.3 


TESTI20254480 


0.0 


0.1 


0.6 


0.8 


0.6 


0.0 


THYMU 10004280 


0.2 


2.2 


0.2 


0.2 


1.1 


1.4 


THYMU20030460 


0.2 


0.6 


0.2 


0.6 


0.8 


2.4 


TRACH20090060 


0.5 


0.3 


2.0 


0.0 


6.8 


1.7 


UTERU20041970 


1.6 


1.1 


1.5 


0.0 


3.5 


2.4 


BRAMY201 25360 


0.8 


1.6 


1.1 


0.0 


0.0 


0.0 


OCBBF201 42290 


0.2 


0.9 


0.3 


0.1 


0.4 


0.0 


SKMUS20006790 


0.7 


0.5 


0.3 


1.0 


0.8 


0.7 


TESTI20030610 


0.1 


2.0 


1.7 


0.0 


0.3 


0.0 


UTERU20026620 


0.4 


4.2 


4.2 


2.4 


2.7 


0.9 



Homology Search Result Data 



[031 3] Data obtained by the homology search for full-length nucleotide sequences and deduced amino acid sequenc- 
es. 

[0314] In the result of the search shown below, both units, aa and bp, are used as length units for the sequences to 
be compared. 

[031 5] Each data includes Clone name, Definition in hit data, P value, Length of sequence to be compared, Homology, 
and Accession number (No.) of hit data. These items are shown in this order and separated by a double-slash mark, //. 
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ADRGL20020290//Human placental equilibrative nucleoside transporter 1 (hENT1) mRNA, complete cds Jl\ 70E-240II 
456aa//1 00%//U8 1 375 

ADRGL20021910//Homo sapiens transmembrane protein B7-H2 ICOS ligand mRNA, complete cds 7/2. 50 E-8 8// 
1 68aa//1 00%//AAG01 1 76 

5 ADRGL20022600//DIAPHANOUS PROTEIN HOMOLOG 1 (P140MDIA).//2.00E-07//121aa//36%//008808 
ADRGL20023920//ABC1 PROTEIN HOMOLOG P RECU RSOR.//1 .40E-98//467aa//45%//Q92338 
ADRGL20026790//Homo sapiens PLIC-1 mRNA, complete cds.//6.10E-05//169aa//31%//AF293384 
ADRGL20027530 
ADRGL20036380 

10 ADRGL20036840//HLA CLASS I HISTOCOMPATIBILITY ANTIGEN, ALPHA CHAIN H 

PRECURSOR (HLA-AR) (HLA-12.4).//7.30E-68//131aa//96%//P01893 

ADRGL20040310 

ADRGL20040770 

ADRGL20046760 
15 ADRGL20047080 

ADRGL20047770 

ADRGL20057560 

ADRGL20059610//GLUCOSYLCERAMIDASE PRECURSOR (EC 3.2.1 .45) (BETA-GLUCOCEREBROSIDASE) (AC- 
ID BETA-GLUCOSIDASE) (D-GLUCOSYL-N-ACYLSPHINGOSINE GLUCOHYDROLASE) (ALGLUCERASE) (IMI- 
20 GLUCERASE).//3.00E-94//1 88aa//93%//P04062 

ADRGL20062330//Homo sapiens trabeculin-alpha mRNA, complete cds.//1 .10E-128//439aa//58%//AF141 968 
ADRGL20063770 

ADRGL20066770//elastin microfibril interface located protein [Homo sapiens].//3.00E-31//210aa//44%//NP_008977 
ADRGL20067320 

25 ADRGL20079060//Mus musculus mRNA for Ky protein (muscle-specific protein) 7/3.80 E-266//281 aa//84%//AJ293727 
ADRGL20095330 

ASTRO20001910//Rattus norvegicus mRNA for annexin V-binding protein (ABP-10), partial cds,//2.20E-57//153aa// 
73%//D 64062 
ASTRO20003720 
30 ASTRO20004820 

ASTRO20006530//Homo sapiens hookl protein (HOOK1) mRNA, complete cds7/1 .80E-94//383aa//55%//AF044923 
ASTRO20009140//PUTATIVE COMPETENCE-DAMAGE PROTEIN.//2.70E-06//1 67aa//29%//P46323 
ASTRO20010010 
ASTRO200 10290 
35 ASTRO200 12270 
ASTRO20020240 
ASTRO20020350 
ASTRO20022020 

ASTRO20026320//NAM7 PROTEIN (NONSENSE-MEDIATED MRNA DECAY PROTEIN 1) (UP-FRAMESHIFT SUP- 
40 PRESSOR 1)7/1 .80E-47//432aa//33%//P30771 
ASTRO20027330 

ASTRO20038400//Homo sapiens zinc finger homeobox protein 2HX1 mRNA, complete cds.//3.40E-78//282aa//42%// 

AF1 06862 

ASTRO20045840 

45 ASTRO20046280//PSU1 PROTEIN.//1 .30E-42//228aa//36%//P53550 
ASTRO20047510 

ASTRO2005081 0//L-RIBULO KINASE (EC 2.7.1 .1 6)7/1 .1 0E-43//51 2a a//30%//P 94524 

ASTRO20052420//PROBABLE GUANINE NUCLEOTIDE REGULATORY PROTEIN TIM (ONCOGENE TIM) (P60 
TIM) (TRANSFORMING IMMORTALIZED MAMMARY ONCOGEN E).//4.80E-1 51 //408aa//76%//Q1 2774 
50 ASTRO20053430//BAND 4.1 -LIKE PROTEIN 4 (NBL4 PROTEIN).//1 .80E-58//307aa//38%//057457 
ASTRO20055530 

ASTRO20055570//MAJOR PRION PROTEIN PRECURSOR (PRP) (PRP27-30) (PRP33-35C) (ASCR).//5.40E-72// 

137aa//100%//P04156 

ASTRO20055930 

55 ASTRO20058960//DNA damage inducible protein homolog - fission yeast (Schizosaccharomyces pombe)//1 .90E-1 4// 
205aa//31 %//T39541 
ASTRO20069200 

ASTRO20075150//TNF RECEPTOR ASSOCIATED FACTOR 3 (CD40 RECEPTOR ASSOCIATED FACTOR 1) 
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(CRAF1) (CD40 BINDING PROTEIN) (CD40BP) (LMP1 ASSOCIATED PROTEIN) (LAP1).//1 .OOE-25//60aa//98%// 
Q13114 

ASTRO20076660 

ASTRO20085080//TIPD PROTEIN.//1 .80E-58//307aa//37%//015736 

5 ASTRO20088950//LACTASE-PHLORIZIN HYDROLASE PRECURSOR (LACTASE-GLYCOSYLCERAMIDASE) [IN- 
CLUDES: LACTASE (EC 3.2.1.108); PHLORIZIN HYDROLASE (EC 3.2.1 .62)] .//7.80E-85//331aa//48%//P09848 
ASTRO20089600//Mus musculus sacsln gene, complete cds.//1 .10E-05//198aa//26%//AF1 93557 
ASTRO20090680//M.musculus mRNA for IB3/5-polypeptide.//1 .60E-173//412aa//78%//X79131 
ASTRO20091180 

10 ASTRO20091770 
ASTRO20141740 

BGGI120000670//Rattus norvegicus myosin heavy chain Myr 8b mRNA, complete cds.//1 .660E-05//86aa//36%// 
AY004215 

BGGI120010750//Rattus norvegicus mRNA for SECIS binding protein 2 (sbp2 gene).//7.90E-305//812aa//71%// 
15 AJ251245 

BNGH41 0000570 

BNGH420008150//Human SH3 domain-containing proline-rich kinase (sprk) mRNA, complete cds.//3.40E-139// 

326aa//82%//U07747 

BNGH420014060 

20 BNGH420015760//Mus musculus mRNA for JNK-binding protein JNKBP1 , complete cds.//1 ,60E-130//381aa//60%// 
AB029482 
BNGH420021680 

BNGH420023870//RIBONUCLEASE INHIBIT0R.//4.70E-41//314aa//34%//P 10775 
BNGH420024870 

25 BNGH420035290//MYOSIN HEAVY CHAIN KINASE B (EC 2.7.1 .129) (MHCK B).//7.60E-37//273aa//35%//P90648 
BNGH420036410 
BNGH420040760 
BNGH420042910 
BNGH420045380 

30 BNGH420046790//immunoglobulin lambda light chain variable region [Homo Sapiens].//5.00E-47//84aa//100%// 
AAG24674 
BNGH420052350 

BNGH420059680//DIPZ PROTEIN.//3.00E-13//1 66aa//31%//Q1 0801 
BNGH420061350 
35 BNGH420062340 

BNGH420070370//ZINC FINGER PROTEIN GLI1 (GLI).//3.20E-65//165aa//59%//P47806 

BNGH420074600//DNA-DIRECTED RNA POLYMERASE III 128 KDA POLYPEPTIDE (EC 2.7.7.6) (RNA POLYMER- 
ASE ill SUBUNIT 2).//5.40E-214//522aa//72%//P25167 
BNGH420075940 

^0 BNGH420077980//Rattus norvegicus ankyrin binding cell adhesion molecule neurofascin mRNA, alternatively spliced 
form, partial cds.//0//588aa//98%//U81 036 
BNGH420085100 

BNGH420086030//N-CHIMAERIN (NC) (N-CHIMERIN) (ALPHA CHIMERIN) (A-CHIMAERIN).//2.30E-21//130aa// 
40%//P15882 

45 BNGH420087430//MUS musculus mRNA 1 for phtf protein.//5.80E-118//237aa//57%//AJ 133721 

BRACE1 000051 0//KERATIN, ULTRA HIGH-SULFUR MATRIX PROTEIN (UHS KERATIN).//4.70E-12//132aa//37%// 
P26371 

BRACE20003310//ZINC FINGER PROTEIN 85 (ZINC FINGER PROTEIN HPF4) (HTF1).//1 .00E-205//504aa//71 %// 
Q03923 

50 BRACE20007330//RING CANAL PROTEIN (KELCH PROTEIN).//4.60E-66//562aa//31%//Q04652 
BRACE20009050 

BRACE20014450//Mus musculus mRNA for Ndr1 related protein Ndr2, complete cds.//3.20E-150//291aa//95%// 
AB033921 
BRACE20017790 
55 BRACE20018810 
BRACE20025820 
BRACE20038920 

BRACE20050870//PRE-MRNA SPLICING HELICASE BRR2 (EC 3.6.1 .-).//8.40E-59//584aa//31%//P32639 
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BRACE20051600 

BRACE20051930//NEUROPILIN PRECURSOR (A5 PROTEIN) (A5 ANTIGEN) 7/5. 50 E-20//1 79aa//30%//P28824 
BRACE20052430//Homo sapiens AMSH mRNA, complete cds.//4. 30 E-75//272aa//53%//U 73522 
BRACE20052530 
5 BRACE20054080 
BRACE20054480 

BRACE20054600//Xenopus laevis mRNA for Kielin, complete cds.//4.30E-70//205aa//60%//AB026192 
BRACE20055560 
BRACE20057870 
10 BRACE20059110 
BRACE20059810 

BRACE20061 620//ZINC-BINDING PROTEIN A33.//3.60E-30//329aa//28%//Q02084 
BRACE20062580 

BRACE20063540//MEROZOITE SURFACE PROTEIN CMZ-8 (FRAGMENT) .60E-1 0//1 64aa//35%//P091 25 
15 BRACE20065470//Xenopus laevis ubiquitin-like fusion protein mRNA, complete cds.//6.70E-63//1 70aa//71 %//L08474 
BRACE20066360 
BRACE20068710 

BRACE20069000//CLN3 PROTEIN (BATTENIN) (BATTEN'S DISEASE PROTEIN).//1 .20E-147//279aa//100%// 
Q13286 
20 BRACE20069110 
BRACE20069440 

BRACE20079200//Xenopus laevis mRNA for Kielin, complete cds.//3.10E-15//63aa//58%//AB026192 
BRACE20079370//microtubule associated-protein orbit [Drosophila melanogaster].//8.00E-42//282aa//36%// 
BAA94248 

25 BRACE20097540//Homo sapiens protein serine/threonine phosphatase 4 regulatory subunit 1 (PP4R1) mRNA, com- 
plete cds.//2.80E-96//1 93aa//96%//AF1 1 1 1 06 
BRACE20098860 
BRACE20099070 

BRACE201 94670//UDP-GALACTOSE TRANSLOCATOR (UDP-GALACTOSE TRANSPORTER) (UGT) (UDP-GAL- 
30 TR).//1 .40E-32//72aa//98%//P78381 

BRACE20196180//Homo sapiens HMG domain protein HMGX2 (HMGX2) mRNA, complete cds.//6.90E-154//235aa// 
91 %//AF 146223 
BRACE201 96960 

BRACE20200770//PROTEIN MOV-1 0.//3.30E-24//113aa//50%//P23249 
35 BRACE20200970 

BRACE20204670//PROTEIN-TYROSINE PHOSPHATASE ALPHA PRECURSOR (EC 3.1.3.48) (R-PTP- ALPHA).// 
4.30E-237//428aa//99%//P1 8433 
BRACE20205840 
BRACE20207420 
40 BRACE20212450 

BRACE20215410//PROTEIN-TYROSINE PHOSPHATASE YVH1 (EC 3.1.3.48) (PTPASE 

YVH1).//2.90E-08//136aa//31%//Q02256 

BRACE20216700 

BRACE20216950//4F2 CELL-SURFACE ANTIGEN HEAVY CHAIN (4F2HC) (LYMPHOCYTE ACTIVATION ANTIGEN 
45 4F2 LARGE SUBUNIT) (4F2 HEAVY CHAIN ANTIGEN) (CD98 ANTIGEN).//4.80E-66//94aa//90%//P08195 

BRACE20219360 

BRAMY1 0000980 

BRAMY1 0001 730 

BRAMY20000210 
50 BRAMY20000250 

BRAMY20001510//Homo sapiens RING zinc finger protein (RZF) mRNA, complete cds.//3.80E-131//245aa//99%// 
AF037204 

BRAMY20003540//1-PHOSPHATIDYLINOSITOL-4,5-BISPHOSPHATE PHOSPHODIESTERASE DELTA 1 (EC 
3.1.4.11) (PLC-DELTA-1) (PHOSPHOLiPASE C-DELTA-1) (PLC-lll).//0//745aa//99%//P51178 
55 BRAMY20003880 

BRAMY20005080//UBIQUITIN CARBOXYL-TERMINAL HYDROLASE 16 (EC 3.1.2.15) (UBIQU1TIN THIOLESTE- 
RASE 16) (UBIQUITIN-SPECIFIC PROCESSING PROTEASE 16) (DEUBIQUITINATING ENZYME 16) (UBIQUITIN 
PROCESSING PROTEASE UBP-M).//2.70E-46//93aa//100%//Q9Y5T5 
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BRAMY2001 3670//PECANEX PROTEIN.//1 ,80E-84//300aa//56%//P1 8490 
BRAMY20016780 
BRAMY20020440 
BRAMY20021580 
5 BRAMY20023390 
BRAMY20023640 
BRAMY20024790 
BRAMY20027390 

BRAMY20027990//Homo sapiens NEDL1 mRNAfor NEDD4-like ubiquitin ligase 1 , complete cds .//4. 60 E-158//294a a// 
10 100%//AB048365 
BRAMY20028530 

BRAMY20028620//N ICOTI N ATE-NUCLEOTID E PYROPHOSPHORYLASE [CARBOXYLATING] (EC 2.4.2.19) (QUI- 
NOLINATE PHOSPHORIBOSYLTRANSFERASE [DECARBOXYLATING]) (QAPRTASE).//9.70E-1 8//53 a ay/84%// 
Q 15274 

15 BRAMY20035380//ZINC TRANSPORTER 1 (ZNT-1 ) 7/5.60 E-40//21 1 aa//35%//Q62720 

BRAMY20035830//Homo sapiens RCC1-like G exchanging factor RLG mRNA, complete cds.//3.60E-103//251aa// 

75%//AF060219 

BRAMY20036530 

BRAMY20036810 

20 BRAMY20038980//1 INTRACELLULAR PROTEIN TRANSPORT PROTEIN US01 7/3.40E-1 7//407aa//23%//P25386 
BRAMY20039290 

BRAMY20040580//ZINC FINGER PROTEIN 1357/1 .70 E-29//1 15 aa//54%//P52 742 
BRAMY20043520 

BRAMY20043630//Homo sapiens Ras-binding protein SUR-8 mRNA, complete cdsV/1 .30E-1 67//364aa//88%// 
25 AF068920 

BRAMY20044920//UBIQUITIN CARBOXYL-TERMINAL HYDROLASE 4 (EC 3.1.2.15) (USIQUITIN THIOLESTE- 
RASE 4) (UBIQUITIN-SPECIFIC PROCESSING PROTEASE 4) (DEUBIQUITINATING ENZYME 4) (UBIQUITOUS 
NUCLEAR PROTEIN HOMOLOG.//7.60E-28//86aa//4796//Q131 07 
BRAMY20045210 
30 BRAMY20045420 
BRAMY20047560 
BRAMY20050640 
BRAMY20050940 

BRAMY20051820//Human mRNA for Doc2 (Double C2), complete cds.//1 .90E-49//102aa//99%//D31897 
35 BRAMY20052440 
BRAMY20053910 

BRAMY20055760//POTENTIAL PHOSPHOLIPID-TRANSPORTING AT PAS E VA (EC 3.6.1 .-).//5. 80 E-130//393aa// 
59%//054827 

BRAMY20056620//Homo sapiens mccb mRNA for non-biotin containing subunit of 3-methylcrotonyl-CoA carboxylase, 
40 complete cds7/3.00E-1 06//203aa//1 00%//AB050049 

B R AM Y20056840//U B E- 1 C2//2 .40 E-74//26 1 aa//53%//AB030505 

BRAMY20063750//Homo sapiens HRIHFB2007 mRNA, partial cds.//3.40E-139//253aa//99%//AB015330 
BRAMY20072440 

BRAMY20072870//ACETYL-COENZYME A SYNTHETASE (EC 6.2.1.1) (ACETATE— COA LIGASE) (ACYL- ACTI- 
45 VATING ENZYME).//2.50E-15//88aa//46%//P16929 
BRAMY20073080 
BRAMY20074110 
BRAMY20074860 

BRAMY20076100//STEROIDOGENIC FACTOR 1 (STF-1) (SF-1) (STEROID HORMONE RECEPTOR AD4BP) 
50 (FUSHI TARAZU FACTOR HOMOLOG 1 ).//4.80E-66//1 32aa//94%//P50569 
BRAMY20076130 
BRAMY20076530 

B RAM Y20083330//S YN APSIN I.//4.50E-05//1 55aa//29%//P 1 7599 
BRAMY20083820 

55 BRAMY20089770//P2X PURINOCEPTOR 7 (ATP RECEPTOR) (P2X7) (PURINERGIC RECEPTOR) (P2Z RECEP- 
TOR).//3.30E-1 36//242aa//99%//Q99572 

BRAMY20091230//MITOCHONDRIAL UNCOUPLING PROTEIN 4 (UCP 4).//4.60E-121//224aa//l00%//095847 
BRAMY20093490//Mus musculus ubiquitin-protein ligase E3-alpha (Ubr1) mRNA, complete cds.//7.80E-43//139aa// 
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55%//AF061555 

BRAMY20094890//A KINASE ANCHOR PROTEIN 4 PRECURSOR (MAJOR FIBROUS SHEATH PROTEIN) (FSC1) 
(P82).//4.60E-06//131aa//27%//Q60662 
BRAMY20095080 
5 BRAMY20095570 

BRAMY20096930//Tbrpedo marmorata mRNA for male sterility protein 2-like protein (ms21 gene)7/2.00E-63//139aa// 

82%//AJ272073 

BRAMY201 00680 

BRAMY20102900//Homo sapiens RU1 (RU1) mRNA, complete cds.//1 .20E-47//151aa//58%//AF168132 
10 BRAMY201 07980 

BRAMY20111780//ZINC FINGER PROTEIN 135.//1 .00E-139//416aa//57%//P52742 

BRAMY20117670//Mus musculus mmDNAJA4 mRNA for mmDj4, complete cds.//3.20E-11 8//239aa//90%//AB032401 
BRAMY201 18410 

BRAMY201 1 8490//GLYCEROL KINASE 2 (EC 2.7.1.30) (ATP:GLYCEROL 3-PHOSPHOTRANSFERASE 2) (GLYC- 
15 EROKINASE 2) (GK 2).//1 .80E-48//247aa//40%//Q9X1 E4 
BRAMY20120170 
B R AM Y20 123400 
B RAM Y20 124970 
BRAMY20125170 

20 BRAMY201 25360//L-ASPARAGINASE (EC 3.5.1.1) (L-ASPARAGINE AMIDOHYDROLASE).//3.30E-53//148aa// 
43%//Q9ZSD6 

BRAMY20125550//Homo sapiens mRNA for 28kD interferon responsive protein (IFRG28 gene).//4.40E-16//155aa// 
33%//AJ251832 
BRAMY201 26910 
25 BRAMY201 27310 
BRAMY201 27760 

BRAMY20134050//NUCLEOSOME ASSEMBLY PROTEIN 1-LIKE 2 (BRAIN-SPECIFIC PROTEIN, X- LINKED).// 
1 .40E-25//1 09aa//569d//P51 860 
BRAMY201 35720 

30 BRAMY20137360//Homo sapiens gene for TU12B1-TY, exon 12 and complete cds.//2.10E-18//257aa//31%// 
AB032786 
BRAMY201 39440 
BRAMY201 39750 

B R AMY20 1 43870//P E PTI D YL-TRN A HYDROLASE (EC 3.1.1.29) (PTH).//5.80E-27//1 82aa//35%//P96386 
35 BRAMY201 5251 0//PROTEIN-TYROSINE PHOSPHATASE STRIATUM-ENRICHED (EC 3.1.3.48) (STEP) (NEU- 
RAL-SPECIFIC PROTE1N-TYROSINE PHOSPHATASE) ( FRAGMENT) 7/4.80 E-293//537aa//97%//P54829 
BRAMY201 55500 

BRAMY201 58550//CALMODULIN.//1 .60E-1 5//1 1 6aa//42%//P04352 
BRAMY201 59250 
40 BRAMY201 60020 
BRAMY20 173480 

BRAMY20190550//EPIDERMAL GROWTH FACTOR RECEPTOR SUBSTRATE SUBSTRATE 15 (PROTEIN EPS15) 
(AF-1 P PROTEIN).//2.20E-226//464aa//89%//P42566 
B RAM Y20 194680 
45 BRAMY20204270 

BRAMY20206340//GUANINE NUCLEOTIDE-BINDING PROTEIN BETA SUBUNIT-LIKE PROTEIN.//4.20E-07// 

151aa//27%//P38011 

B RAM Y2021 9620 

BRAMY20221600//H.sapiens mRNA for novel T-cel activation protein.//1 .60E-130//245aa//99%//X94232 
50 BRAMY20223010//Mus musculus leucine-rich glioma-inactivated 1 protein precursor, (Lgi1) mRNA, complete cds.// 
2.00E-79//269aa//52%//AF24681 8 
BRAMY20225250 
BRAMY20225320 
BRAMY20227230 

55 BRAMY20227860//Homo sapiens dickkopf-3 (DKK-3) mRNA, complete cds.//2.30E-69//147aa//92%//AF 177396 
BRAMY20227960 

BRAMY20231 1 50//PUTATIVE ACID PHOSPHATASE F26C11.1 (EC 3.1 .3. 2)7/2.30 E-55//322aa//39%//Q 09 549 
BRAMY20234820//Homo sapiens mitotic checkpoint protein (MAD1) mRNA, complete cds.//1 .30E-286//561aa// 
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100%//AF123318 
BRAMY20237190 

BRAMY20238630/>TETRATRICOPEPTIDE REPEAT PROTEIN 47/1 20E-147//276aa//99%//095801 
BRAMY20243120 

5 BRAMY20244490//A DENY LATE KINASE ISOENZYME 1 (EC 2.7.4.3) (ATP-AMP TRANSPHOSPHORYLASE) (AK1) 
(MYOKINASE).//2.50E-1 9//1 1 9aa//37%//P00571 

BRAMY20245140//Rattus norvegicus potassium channel (erg2) mRNA, complete cds7/1.00E-178//427aa//81%// 

AF016192 

BRAMY20245350 

10 BRAMY20245760//Araneus diadematus fibroin-4 mRNA. partial cds.//7.90E-05//285aa//22%//U47856 

BRAMY20251210//EPHRIN TYPE-A RECEPTOR 7 PRECURSOR (EC 2. 7. 1. 112) (TYROSINE-PROTEIN KINASE 
RECEPTOR EHK-3) (EPH HOMOLOGY KINASE-3) (EMBRYONIC BRAIN KINASE) (EBK) (DEVELOPMENTAL KI- 
NASE 1) (MDK-1)7/3.80E-94//268aa//66%//Q61772 

BRAMY20251750//Homo sapiens BRI3 mRNA, complete cds.//2.80E-131//242aa//95%//AF272043 
15 BRAMY20263000//DYSTROPHIA MYOTONICA-CONTAINING WD REPEAT MOTIF PROTEIN (DMR-N9 PRO- 
TEIN).//2.60E-134//430aa//59%//Q08274 
BRAMY20267780 
BRAMY20269040 
BRAMY20271140 

20 BRAMY20274510//60S RIBOSOMAL PROTEIN L12.//1 .10E-39//102aa//82%//P30050 
BRAMY20285650 
BRAMY20287400 

BRAWH20014590//ZAKI-4 PROTEIN7/3.1 0 E- 92// 1 87aa//93%//Q 14206 
BRAWH20020470 
25 BRAWH20020600 

BRAWH20021910//FATTY ACYL-COA HYDROLASE PRECURSOR, MEDIUM CHAIN (EC 3.1.2.14) (THIOESTE- 

RASE B)7/1 .30E-111//450aa//49%//Q04791 

BRAWH20025490 

BRAWH20026010//AD021 protein [Homo sapiens]//4.00E-55//245aa//44%//NP_057697 
30 BRAWH20027250 
BRAWH20030000 

BRAWH20039640//SLIT PROTEIN PRECURSOR7/6.10E-19//282aa//31%//P24014 

B R AWH20 040680//P UTATI V E TRANSCRIPTION ELONGATION FACTOR S-ll (TFIIS).//5.90E-06//1 79aa//29%// 
P52652 
35 BRAWH20047790 

BRAWH20050740//ZINC FINGER PROTEIN 151 (POLYOMAVIRUS LATE INITIATOR PROMOTER BINDING PRO- 
TEIN) (LP-1) (ZINC FINGER PROTEIN Z1 3)7/1 .60 E-1 6//235aa//30%//Q 60821 
BRAWH20055240 
BRAWH20055330 
40 BRAWH20055780 
BRAWH20058120 

BRAWH20063010//SPLICEOSOME ASSOCIATED PROTEIN 49 (SAP 49) (SF3B53)7/2.60E-06//121 aa//33%// 
Q 15427 

BRAWH20078080 
45 BRAWH20078620 

BRAWH20080580//ZINC FINGER PROTEIN 84 (ZINC FINGER PROTEIN HPF2)7/1 .00E-116//316aa//63%//P51523 
BRAWH20082550 

BRAWH20082920//Human TFIIIC Box B-binding subunlt mRNA, complete cds.//1 90E-36//72aa//100%//U02619 
BRAWH20093040//PROTEIN KINASE CLK2 (EC 2.7.1 .-).//2. 70 E-86//1 62aa//96%//P49760 
50 BRAWH20093070//SYNAPSIN.//4.80E-06//245aa//28%//Q24546 

BRAWH20094900//Mus musculus mRNA for sialidase, complete cds.//5.70E-73//310aa//50%//AB026842 
BRAWH20095900//ZINC FINGER PROTEIN 184 ( FRAGM E NT) 7/1 .20E-170//631aa//48%//Q99676 
BRAWH20 173790 

BRAWH20174330//SPLICEOSOME ASSOCIATED PROTEIN 49 (SAP 49) (SF3B53)7/4.50E-06//121 aa//33%// 
55 Q 15427 

BRAWH20 175230 
BRAWH20 175340 

BRAWH201 76850//Mus musculus mRNA for nuclear protein ZAP, complete cds7/9.50E-1 51//61 9aa//53%//AB0331 68 
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BRAWH20 182670 

BRAWH20183170//GRR1 PROTEIN.//9.30E-13//218aa//28%//P24814 
BRAWH20 185260 
BRAWH20 185270 
5 BRAWH201 86010 

BRAWH20188750//BIOTIN SYNTHESIS PROTEIN BIOC.//5.80E-11//190aa//27%//P36571 

BRAWH20190530//Homo sapiens BNPI mRNA for brain-specific Na-dependent Inorganic phosphate cotransporter, 
complete cds7/2.1 0E-1 09//1 1 8aa//1 00%//AB032436 

BRAWH201 90550//PUTATIVE SERIN E/THREONINE-PROTEIN KINASE PKWA (EC 2.7.1 .-) 7/1 .30 E-05//1 72aa// 
10 29%//P49695 

BRAWH20191980//PROLINE OXIDASE, MITOCHONDRIAL PRECURSOR (EC 1.5.3.-) (PROLINE DEHYDROGE- 
NASE)./^.! 0E-1 25//234aa//99%//043272 

BRCAN1 0000760//UREA TRANSPORTER, ERYTHROCYTE.//1 .30E-212//389aa//1 00%//Q13336 
BRCAN10001050//PEANUT-LIKE PROTEIN 2 (BRAIN PROTEIN H5).//1 .40E-62//122aa//98%//043236 
15 BRCAN 10001 680 
BRCAN20001480 

BRCAN200041 80//alpha-1 C- adrenergic receptor splice form 2 - human//1 .1 0E-22//76aa//76%//S65657 
BRCAN20005230//HEPARIN SULFATE N- D E AC ETYLAS E/N-S U L FOT RAN S F E R AS E (EC 2.8.2.-) (N- HSST) 
(N-HEPARIN SULFATE SULFOTRANSFERASE) (GLUCOSAMINYL N -DEAC ETYLAS E/N-S UL FOT RAN S- 
20 FERASE).//8.90E-15//168aa//28%//P52849 

BRCAN20005410//Human 1(3)mbt protein homolog mRNA, complete cds.//2.00E-95//378aa//50%//U89358 
BRCOC1 0000400 

BRCOC20000470//Homo sapiens DEME-6 mRNA, partial cds7/7.30E-37//252aa//28%//AF007170 
BRCOC20003600//VACUOLAR ATP SYNTHASE SUBUNIT AC45 PRECURSOR (EC 3.6.1.34) (V-ATPASE AC45 
25 SUBUN IT) 7/5.90E-1 92//41 8aa//85%//P40682 
BRHIP1 0000720 

BRHIP10001040//tweety homolog 1 (Drosophila) [Mus musculus]//1 .30E-68//311aa//44%//NP_067299 
BRHIP20000210 
BRHIP20003590 
30 BRHIP20005060 
BRSSN20001970 

BRSSN2000561 0//Mus musculus semaphorin cytoplasmic domain-associated protein 3A (Semcap3) mRNA, complete 

cds.//6.30E-225//730aa//60%//AF127084 

BRSSN20005660 

35 BRSSN20066440//ZINC FINGER PROTEIN 2027/3.60E-37//1 69aa//37%//0951 25 

BRSSN20074640//HYPOTHETICAL 35.8 KDA PROTEIN IN PRP16-SRP40 INTERGENIC REG ION 7/4.50 E-20// 

217aa//28%//P36163 

BRSSN20091190 

BRSSN20092440 

40 BRSSN20093890//Homo sapiens mRNA for Kelch motif containing protein, complete cds.//8.40E-13//203aa//30%// 
AB026190 

CD34C20001750//MHC CLASS I NK CELL RECEPTOR PRECURSOR (NATURAL KILLER ASSOCIATED TRAN- 
SCRIPT 4) (NEAT-4).//1 .10E-18//214aa//3596//P43630 
CTONG1 0000090 
45 CTONG20000340 
CTONG20002790 

CTONG20004110//Mus musculus ankycorbin mRNA, complete cds.//6.20E-55//1006aa//24%//AF202315 
CTONG20004520/ldevelopment- and differentiation-enhancing factor 2; PYK2 C terminus-associated protein [Homo 
Sapiens].//2.00E-86//310aa//81%//NP_003878 
50 CTONG20007660//Rattus norvegicus caspase recruitment domain protein 9 mRNA, complete cds 7/7.30 E-28//31 9aa// 
32%//AF311288 

CTONG20008190//YPT1 -RELATED PROTEIN 27/3.00E-30//160aa//40%//P17609 

CTONG20008460 

CTONG20015240 

55 CTONG20017490//SEMAPHORIN 4A PRECURSOR (SEMAPHORIN B) (SEMA B)7/3.10E-273//607aa//82%// 
Q62178 

CTONG20020660 

CTONG20020950//ZINC FINGER PROTEIN 37 (ZFP-37) (MALE GERM CELL SPECIFIC ZINC FINGER PROTEIN).// 
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7.40E-23//258aa//25%//P1 7141 
CTONG20027660 

CTONG20029030//Homo sapiens Ras-binding protein SUR-8 mRNA, complete cds7/8.30E-25//402aa//28%// 
AF068920 

5 CTONG20030280/A/EGETATIBLE INCOMPATIBILITY PROTEIN HET-E-1 7/2.90 E-12//303aa//25%//Q 00808 
CTONG20031150 
CTONG20031890 

CTONG20032930//microtubule associated-protein orbit [Drosophila melanogaster]//1 .00E-79//91 3aa//30%// 
BAA94248 
10 CTONG20033500 

CTONG20033610//Rattus norvegicus SNIP-a mRNA, complete cds.//2.50E-145//567aa//41%//AF 156981 
CTONG20033750//Drosophila melanogaster AAA family protein Bor (bor) mRNA, complete cds.//1 .40E-174//492aa// 
66%//AF227209 
CTONG20035240 
15 CTONG20036800 

CTON G20036990//BAS E M E NT MEMBRANE-SPECIFIC HEPARAN SULFATE PROTEOGLYCAN CORE PROTEIN 

PRECURSOR (HSPG) (PERLECAN) (PLC).//1 .10E-10//247aa//27%//Q05793 

CTONG20039370 

CTONG20041150//Streptomyces ansochromogenes strain 71 00 SanE (sanE) gene, complete cds.//5.20E-05//133aa// 
20 35%//AF228524 

CTONG20041260//Mus musculus retinoic acid- responsive protein (Stra6) mRNA, complete cds.//1 .30E-238//602aa// 
74%//AF062476 

CTON G20042640//NEU ROB LAST DIFFERENTIATION ASSOCIATED PROTEIN AHNAK (DESMOYOKIN) (FRAG- 
MENTS). //0//797aa//7396//Q09666 
25 CTONG20044230//Mus musculus zinc finger protein (Mtshl) mRNA, partial cds.//1 .40E-289//601aa//89%//AF1 91309 
CTONG20044870 

CTONG20045500//ANION EXCHANGE PROTEIN 3 (CARDIAC/BRAIN BAND 3-LIKE PROTEIN) (CAE3/BAE3).// 
2.00E-1 9//276aa//30%//P48751 
CTONG20046690 
30 CTONG20049480 
CTONG20050490 

CTONG20051100//PUTATIVE METHYLTRANSFERASE (EC 2. 1 . 1. -)7/3.50E-29//72aa//87%//043709 
CTONG20051 450//testis development protein PRTD [Homo sapiens}7/9.00E-50//1 40aa//85%//AAG33852 
CTONG20052780//Homo sapiens mRNA for SH3 binding protein, complete cds 7/B. 00 E-21//125aa//42%//AB0 05047 
35 CTONG20053990//ZINC FINGER PROTEIN 1957/4.30E-08//40aa//75%//014628 
CTONG20055670 

CTONG20055850//Rattus norvegicus golgi peripheral membrane protein p65 (GRASP65) mRNA, complete cds.// 
1 .1 0E-99//248aa//78%//AF01 5264 
CTONG20056150 
40 CTONG20057750 
CTONG20057950 

CTONG20059130//Mus musculus prominin-like protein mRNA, partial cds.//7.50E-103//259aa//7796//AF128113 
CTONG20060040 
CTONG20061290 
45 CTONG20062730 

CTONG20063770//M-PHASE PHOSPHOPROTEIN 9 ( FRAGMENT) 7/1 .80E-96//1 84aa//100%//Q99550 
CTON G20063930//BETA-CH IMA ERIN (BETA-CHIMERIN)7/3.50E-31//189aa//34%//Q03070 
CTONG20065240 
CTONG20065680 

so CTONG20066110//Homo sapiens DEME-6 mRNA, partial cds7/8.70E-164//557aa//53%//AF007170 

CTONG20068360//MITOCHONDRIAL CARNITINE/ACYLCARNITINE CARRIER PROTEIN (CARNITINE/ACYLCAR- 

NITINE TRANSLOCASE) (CAC)7/8.00E-30//248aa//35%//P97521 

CTONG20069320 

CTONG20069420 
55 CTONG20070090 

CTONG20070720//N-CHIMAERIN (NC) (N-CHIMERIN) (ALPHA CHIMERIN) (A-CHIMAERIN)7/9.20E-25//180aa// 
33%//P30337 

CTONG20070780//SPERM-SPECIFIC ANTIGEN 2 (CLEAVAGE SIGNAL-1 PROTEIN) (CS-1)7/1 .90E-122//249aa// 
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97%//P28290 

CTONG2047091 0//Homo sapiens mRNA for 26S proteasome subunit p55, complete cds.//7.70E-227//400aa//1 00%// 
AB003103 

CTONG20071040//BETA CRY STALL) N B2 (BP).//6.80E-25//195aa//34%//P26775 
5 CTONG20071 680//HYPOTHETICAL 33.6 KDA PROTEIN IN TDK-PRFA INTERGENIC REGION.//7.40E-14//328aa// 
23%//P45869 

CTONG20072930//ZINC FINGER PROTEIN 41 (FRAGMENT)V/4.10E-21 6//542aa//69%//P51814 
CTONG20073990 

CTONG20074000//Mus musculus teashirt 2 (Tsh2) gene, partial cds7/0//1024aa//89%//AF207880 
W CTONG20074170 
CTONG20074740 
CTONG20076230 

CTONG20076810//site-1 protease of sterol regulatory element binding proteins [Cricetulus griseus]//2.80E-245// 
463aa//93%//AF078l05 
15 CTONG20077760//SYNAPSIN 17/1 .70E-08//209aa//32%//P1 7599 

CTONG20078340//SUPPRESSOR PROTEIN SRP40.//4.1 0E-08//282aa//26%//P32583 

CTONG20079590//ALPHA-N-ACETYLGALACTOSAMINIDE ALPHA-2.6-SIALYLTRANSFERASE (EC 2.4.99.-) 
(ST6GALNACIII) (STY).//1 .80E-151//305aa//85%//Q64686 

CTONG200801 40//HYPOTHETICAL 60.3 KDA PROTEIN R13G10.2 IN CHROMOSOME III.//8.40E-29//1 79aa//40%// 
20 Q21988 

CTONG20081840 
CTONG20083430 

CTONG20083980//VASODILATOR-STIMULATED PHOSPHOPROTEIN (VASP).//4.50E-10//113aa//34%//P50552 
CTONG20084020 

25 CTONG20084660//ZINC FINGER PROTEIN 165.//3.30E-33//142aa//57%//P4991 0 

CTONG20085210//MONO- AND DIACYLGLYCEROL LIPASE PRECURSOR (EC 3.1.1.-) (MDGL).//3.60E-06//94aa// 
34%//P25234 
CTONG201 33720 
CTONG201 65590 

30 CTONG20165750//SON PROTEIN (SON3).//4.40E-239//427aa//99%//P18583 
CTONG201 66580 
CTONG201 67750 
CTONG201 68240 
CTONG201 68460 

35 CTONG20169040//KERATIN, TYPE I CYTOSKELETAL 15 (CYTOKERATIN 15) (K15) (CK 15).//6.00E-112//223aa// 
99%//P19012 
CTONG201 69530 

CTONG20170940//MYOTROPHIN (V-1 PROTEIN) (GRANULE CELL DIFFERENTIATION PROTEIN). //4.60E-10// 
93aa//36%//P80144 
40 CTONG20174290//TRICHOHYALIN.//1.30E-07//340aa//21%//P37709 
CTONG201 74440 

CTONG20174580//Homo sapiens mRNA for vascular Rab-GAP/TBC-containing protein complete cds.//1 .00E-1 15// 
335aa//61 %//NP_008994 

CTONG20176040//ADP-RIBOSYLATION FACTOR-LIKE PROTEIN 3 (ARD3)7/1 .70E-34//155aa//43%//P37996 
45 CTONG20 179390 

CTONG201 79890 

CTONG201 79980 

CTONG201 80620 

CTONG201 80690 
50 CTONG20181350 

CTONG20183430//ANKYRIN 2 (BRAIN ANKYRIN) (ANKYRIN B) (ANKYRIN, NONERYTHROID).//4.60E-30//311aa// 
32%//Q01484 

CTONG20183830//IRLB [Homo sapiens]//1 .50E-104//191aa//100%//CAA45013 
CTONG20184130 

55 CTONG20184830//ATP-BINDING CASSETTE. SUB-FAMILY A, MEMBER 1 (ATP-BINDING CASSETTE TRANS- 
PORTER 1) (ATP-BINDING CASSETTE 1).//1 .30E-63//271 aa//47%//P41233 
CTONG201 86140 

CTONG201 86290//ALDEHYDE DEHYDROGENASE, DIMERIC NADP-PREFERRING (EC 1.2.1.5) (ALDH CLASS 
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3).//1 .50E-74//144aa//100%//P30838 

CTONG20186370//ZINC FINGER PROTEIN 84 (ZINC FINGER PROTEIN HPF2).//2.60E-52//324aa//33%//P51523 
CTONG20186520//ZINC FINGER PROTEIN MFG-37/1 .40E-197//643aa//53%//P1 6374 
CTONG20186550//cca3 protein - rat //2.l0E-37//141aa//56%//T31081 
5 CTONG201 88080 

CTONG20189000//PROBABLE GUANINE NUCLEOTIDE REGULATORY PROTEIN TIM (ONCOGENE TIM) (P60 
TIM) (TRANSFORMING IMMORTALIZED MAMMARY ONCOGEN E).//1 .10E-48//222aa//50%//Q1 2774 
CTONG20190290//39.1 KDA PROTEIN IN SURE-CYSC INTERGENIC REGION.//6.00E-15//132aa//31%//Q57261 
CTONG201 90630 

10 DFNES20016470//Homo sapiens SDP1 protein mRNA, complete cds.//4.90E-33//95aa//37%//AF076957 

D FN ES20018000//CADH ERIN-RELATED TUMOR SUPPRESSOR PRECURSOR (FAT PROTElN).//8.70E-23// 
306aa//30%//P33450 

DFNES20025500//Homo sapiens mRNA for paraplegin-like protein.//3.00E-29//68aa//94%//Y18314 
DFN ES200281 70//Mouse mRNA for RN A polymerase I associated factor (PAF53), complete cds.//1 .40E-1 65//393aa// 
15 78%//D 14336 

DFNES20029660 
DFNES20032550 
DFNES20043710 

DFNES20046840//FORKHEAD BOX PROTEIN E1 (FORKHEAD-RELATED PROTEIN FKHL15) (THYROID TRAN- 
20 SCRIPTION FACTOR 2) (TTF-2).//6.20E-05//151 aa//32%//000358 

DFNES20055400//Homo sapiens diphthamide biosynthesis protein-2 (DPH2) mRNA, complete cds.//5.40E-203// 
41 3aa//91%//AF053003 

DFNES20057660//GRAVE'S DISEASE CARRIER PROTEIN (GDC) (MITOCHONDRIAL SOLUTE CARRIER PRO- 
TEIN HO MO LOG) J/7. 00 E-3 1 //247aa//33%//Q0 1888 
25 DFNES20063460//PAB-DEPENDENT POLY(A)-SPECIFIC RIBONUCLEASE SUBUNIT PAN3 (EC 3.1.13.4) 
(PAB1 P-DEPENDENT POLY(A)-NUCLEASE).//1 .90E-23//115aa//43%//P361 02 

D FN ES20072990//H YPOTH ETICAL 46.7 KDA PROTEIN IN HOR7-COX7 INTERGENIC REGION.//1 .80E-22//31 Oaa// 
25%//Q04835 

DFNES20073320//Mus musculus RING-finger protein MURF mRNA, complete cds.//6.70E-118//362aa//61%// 
30 AF294790 

DFNES20076340 

DFNES20080880//POLYPEPTIDE N-ACETYLGALACTOSAMINYLTRANSFERASE (EC 2.4.1.41) (PROTEIN- UDP 
ACETYLGALACTOSAMINYLTRANSFERASE) (UDP-GALNAC: POLYPEPTIDE, N-ACETYLGALACTOSAMINYL- 
TRANSFERASE) (GALNAC-T1 )V/2.00E-1 04//486aa//43%//Q1 0472 
35 DFNES20088810 

DFNES20094820//coronin-like protein [Schizosaccharomyces pombe]//3.60E-20//333aa//24%//CAB11184 
FCBBF10000230//H. sapiens mRNA from TYL gene.//3.30E-155//650aa//51%//X99688 
FCBBF1 0002200 

FCBBF10004760//Homo sapiens GAP-like protein (N61) mRNA, complete cds.//9.10E-82//412aa//44%//AF251038 
40 FCBBF2001 8680//RABPHILIN-3A.//1 .70E-1 6//262aa%/30%//P47709 
FCBBF20020440 
FCBBF20021110 

FCBBF20023490//PUTATIVE PRE-MRNA SPLICING FACTOR ATP-DEPENDENT RNA HELICASE 
SPAC1 0F6.02C.//1 .20E-90//505aa//39%//042643 
45 FCBBF20028980 
FCBBF20029280 
FCBBF20032930 

FCBBF20033360//RING CANAL PROTEIN (KELCH PROTEIN).//7.40E-33//234aa//32%//Q04652 

FCBBF20035430//Mus musculus arsenite inducible RNA associated protein (Airap) mRNA, complete cds.//1 .50E-51// 
50 152aa//57%//AF224494 

FCBBF20035490//GAP-associated tyrosine phosphoprotein p62 (Sam68) [Homo sapiens] >pirllA38219 GAP-associ- 

ated tyrosine phosphoprotein p62//1 .50E-214//415aa//93%//NP_006550 

FCBBF20036360 

FCBBF20038230 
55 FCBBF20038950 

FCBBF20041 380 

FCBBF20043730 

FCBBF20054390 
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FCBBF20056580//Mus musculus NSD1 protein mRNA, complete cds.//3.40E-304//773aa//75%//AF064553 

FCBBF20059660 

FCBBF20061310 

FCBBF20066340//Homo sapiens nuclear dual -specificity phosphatase (SBF1) mRNA, partial cds.//1 ,20E-68//312aa// 
5 49%//U93181 

FCBBF20070800 

FCBBF20070950//MICRONUCLEAR LINKER HISTONE POLYPROTEIN (MIC LH) [CONTAINS: LINKER HfSTONE 
PROTEINS ALPHA, BETA, DELTA AND GAMMA].//5.00E-10//601aa//20%//P40631 
FCBBF30000010 
10 FCBBF30001020 

FCBBF30001100//CRAG protein [Drosophita melanogaster]//7.40E-185//800aa//46%//CAA76938 
FCBBF30001150 

FCBBF30002270//HISTONE H1' (H1.0) (H1 (0)).//4.90E-62//154aa//84%//P07305 

FCBBF30002280//THIOREDOXIN PEROXIDASE 2 (THIOREDOXIN-DEPENDENT PEROXIDE REDUCTASE 2) 
15 (PROLIFERATION-ASSOCIATED PROTEIN PAG) (NATURAL KILLER CELL ENHANCING FACTOR A) (NKEF-A).// 
1 .20E-27//61 aa//98%//Q06830 
FCBBF30002330 

FCBBF30003810//ZINC FINGER PROTEIN 91 (ZINC FINGER PROTEIN HTF10) (HPF7).//9.10E-93//313aa//53%// 
Q05481 

20 FCBBF30004340//Homo sapiens GalNAc-T9 mRNA for UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, 
complete cds.//1 .60E-109//299aa//63%//AB040672 
FCBBF30004730 
FCBBF30005180 

FCBBF30005360//Mus musculus spermatogenesis associated factor (SPAF) mRNA, complete cds.//0//894aa//84%// 
25 AF049099 

FCBBF30005500//HYPOTHETICAL PROTEIN KIAA0167.//5.80E-16//124aa//36%//Q99490 

FCBBF30019140//CHROMODOMAIN HELICASE-DNA-BINDING PROTEIN 3 (CHD-3) (MI-2 AUTO ANTIGEN 240 
KDA PROTEIN) (MI2-ALPHA).//0//725aa//82%//Q12873 

FCBBF30019180//SERINE/THREONINE PROTEIN PHOSPHATASE 2A, 65 KDA REGULATORY SUBUNIT A, AL- 
30 PHA ISOFORM (PP2A, SUBUNIT A, PR65-ALPHA ISOFORM) (PP2A, SUBUNIT A, R1 -ALPHA ISOFORM).//4.60E- 
233//451 aa//98%//P5461 2 
FCBBF30019240 

FCBBF30021900//ZINC FINGER PROTEIN 91 (ZINC FINGER PROTEIN HTF10) (HPF7).//4.10E-161//633aa//48%// 
Q05481 

35 FCBBF30022680//putative 5'-3' exonuclease//9.00E-12//200aa//25%//AAG29662 

FCBBF30026580//Homo sapiens retinoblastoma-associated protein RAP140 mRNA, complete cds.//7.60E-27// 
367aa//28%//AF1 80425 

FC BBF30029250//GLU COAM YLAS E S1/S2 PRECURSOR (EC 3.2.1.3) (GLUCAN 1 ,4-ALPHA-GLUCOSIDASE) 
(1 ,4-ALPHA-D-GLUCAN GLUCOHYDROLASE).//4.00E-1 8//754aa//23%//P 08640 
40 FCBBF30035570 

FCBBF30042610//Homo sapiens CTL2 gene.//2.10E-137//393aa//60%//AJ245621 
FCBBF30048420//TBX19 PROTEIN (T-BOX PROTEIN 1 9).//1 .40E-94//212aa//85%//060806 

FCBSF30053300//Human autoantigen pericentriol material 1 (PCM-1) mRNA, complete cds.//0//708aa//90%//L27841 
FCBBF30056980 

45 FCBBF30062490//Mus musculus prominin-like protein mRNA, partial cds.//7.70E-85//210aa//79%//AF 12811 3 
FCBBF30063990 
FCBBF30068210 

FCBBF30071500//Homo sapiens dentin phosphoryn mRNA, complete cds.//2.80E-09//675aa//22%//AF094508 
FCBBF30072440//Homo sapiens SARDH mRNA, alternatively spliced, complete cds.//1 .70E-14//81aa//53%// 
50 AF095737 

FCBBF30072480 
FCBBF30074530 
FCBBF30074620 
FCBBF30075970 

55 FCBBF3007631 0//CAMP-DE PENDENT PROTEIN KINASE, BETA- CATALYTIC SUBUNIT (EC 2.7.1.37) (PKA C-BE- 
TA).//8.20E-1 66//240aa//1 00%//P22694 
FCBBF30078600 
FCBBF30079770 
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FCBBF30080730//SPLICING FACTOR, ARGININE/SERINE-RICH 7 (SPLICING FACTOR 9G8).//3.40E-70//136aa// 

95%//Q 16629 

FCBBF30081000 

FCBBF30085560//HYPOTH ETICAL 60.3 KDA PROTEIN R13G10.2 IN CHROMOSOME III.//1 .10E-87//531aa//38%// 
5 Q21988 

FCBBF30088700 
FCBBF30089380 
FCBBF30091010 

FCBBF30091 520//GLUCOAMYLASE S1/S2 PRECURSOR (EC 3.2.1.3) (GLUCAN 1 ,4-ALPHA-GLUCOSIDASE) 
10 (1 ,4-ALPHA-D-GLUCAN GLUCOHYDROLASE).//3.70E-09//631 aa//21 %//P08640 

FCBBF30093170//ZINC FINGER PROTEIN 84 (ZINC FINGER PROTEIN HPF2).//2.1 0E-63//173aa//65%//P51523 

FCBBF30095410 

FCBBF30099490 

FCBBF30100080//ARF NUCLEOTIDE-BINDING SITE OPENER (ARNO PROTEIN) (ARF EXCHANGE FACTOR).// 
15 1.10 E-57//1 08aa//1 00%//Q994 1 8 

FCBBF30100120//Mus musculus semaphorln cytoplasmic domain-associated protein 3A (Semcap3) mRNA, complete 
cds.//2. 1 0E-1 92%/769aa//50%//AF1 27084 

FCBBF30100410//Mus musculus testis-specific Y-encoded-like protein (Tspyll) mRNA, complete cds.//1 .90E-56// 
324aa//42%//AF0421 80 
20 FCBBF30101240 
FCBBF30101300 
FCBBF301 05080 

FCBBF30105440//Rattus norvegicus ion transporter protein (NRITP) mRNA, partial cds.//3.40E-36//82aa//91%// 
AF1 84921 

25 FCBBF30105860//microtubute associated-protein orbit [Drosophila melanogaster].//1 .00E-79//556aa//33%// 
BAA94248 
FCBBF301 06950 

FCBBF30107290//MITOCHONDRIAL PROCESSING PEPTIDASE ALPHA SUBUNIT PRECURSOR (EC 3. 4. 24. 64) 
(ALPHA-MPP) (P-55) (HA1523) (K I AA01 23)7/1 .00E-91//1 72aa//100%//Q10713 
30 FCBBF301 07330 
FCBBF30114180 

FCBBF30114850//Homo sapiens C2H2 (Kruppel-type) zinc finger protein mRNA, complete cds.//3.20E-24//249aa// 
34%//AF1 59567 
FCBBF301 15230 

35 FCBBF30115920//Homo sapiens nolp mRNA, complete cds.//9.40E-220//257aa//100%//AB0 17800 

FCBBF30118670//Homo sapiens disintegrin and metalloproteinase domain 19 (ADAM 19) mRNA, partial cds.//0// 
601 aa//97%//AF1 34707 

FCBBF30118890//Drosophila melanogaster La related protein (larp) mRNA, partial cds.//6.70E-25//221aa//35%// 
AF221108 
40 FCBBF301 25460 

FCBBF30125880//Homo sapiens single-strand selective monofunctional uracil DNA glycosylase mRNA, complete 
cds.//6.40E-81//96aa//1 00%//AF1 251 82 
FCBBF301 28420 

FCBBF301 2901 0//ZINC FINGER PROTEIN 36 (ZINC FINGER PROTEIN KOX1 8) (FRAGMENT).//1 .20E-1 79//322aa// 
45 100%//P 17029 

FCBBF30130410//CALDESMON (CDM).//3.30E-06//170aa//32%//P 12957 
FCBBF301 30580 

FCBBF30132050//Homo sapiens mRNA for UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltrans- 
ferase.//2.1 0E-43//253aa//36%//Y1 501 4 
50 FCBBF30132660//Drosophila melanogaster Canton S tartan protein (trn) mRNA, complete cds.//2.00E-15//293aa// 
30%//U02078 

FCBBF301 35890//GLUTENIN, LOW MOLECULAR WEIGHT SUBUNIT PRECURSOR.//2.60E-07//163aa//34%// 
P 10385 

FCBBF30136230//NIL-2-AZINC FINGER PROTEIN (NEGATIVE REGULATOR OF IL2) (TRANSCRIPTION FACTOR 
55 8).//0//1 090aa//94%//P37275 

FCBBF30138000//trg protein - rat//1 .30 E-82//560aa//37%//l 60486 

FCBBF30142290//dJ127B20.3 (novel PHD finger protein) [Homo Sapiens].//1 .00E-140//287aa//96%//CAB62994 
FCBBF30143550//FYVE FINGER-CONTAINING PHOSPHOINOSITIDE KINASE (EC 2.7. 1.68) (1-PHOSPHATIDYLI- 



188 



EP 1 308 459 A2 



NOSITOL-4-PHOSPHATE KINASE) (PIP5K) (PTDINS(4)P-5- KINASE) (P235).//0//1027aa//91 %//Q9Z1T6 

FCBBF301 45670 

FCBBF30151190 

FCBBF301531 70//6-PHOSPHOFRUCTOKINASE, LIVER TYPE (EC 2.7.1.11) (PHOSPHOFRUCTOKINASE 1) 
5 (PHOSPHOHEXOKINASE) (PHOSPHOFRUCTO-1 -KINASE ISOZYME B).//0//670aa//99%//P17858 

FCBBF30157270//Rattus norvegicus PAPIN mRNA, complete cds.//1 .20 E-179//639aa//58%//AF1 69411 
FCBBF30161780 

FCBBF301 6451 0//RETINAL-CADHERIN PRECURSOR (R-CADHERIN) (R-CAD).//0//794aa//98%//P55283 
FCBBF30166220//SERINE HYDROXYMETHYLTRANSFERASE, CYTOSOLIC (EC 2.1. 2.1) (SERINE METHYLASE) 
10 (GLYCINE HYDROXYMETHYLTRANSFERASE) (SHMT).//2.70E-12//33aa//1 00%//P34896 

FCBBF30169280//Petunia x hybrida PGPD14 (PGPD14) mRNA, complete cds.//1 .40E-62//261aa//42%//AF049930 
FCBBF301 69870 
FCBBF301 70710 

FCBBF30171230//NEUROENDOCRINE CONVERTASE 2 PRECURSOR (EC 3.4.21 .94) (NEC 2) (PC2) (PROHOR- 
15 MONE CONVERTASE 2) (PROPROTEIN CONVERTASE 2) (KEX2-LIKE ENDOPROTEASE 2)7/1 .70E-82//181aa// 
86%//P16519 
FCBBF301 72330 

FCBBF30173960//erythroid differentiation-related factor 1 [Homo sap i ens] 7/6. 00 E-32//113aa//100%//AAC 00001 
FCBBF301 75350//MITOG EN-ACTIVATED PROTEIN KINASE 7 (EC 2.7.1 .-) (EXTRACELLULAR SIGNAL- REGULAT- 
20 ED KINASE 5) (ERK5) (ERK4) (BMK1 KINASE).//4.60E-06//245aa//28%//Q131 64 

FCBBF30177290//HYPOTHETICAL47.6 KDA PROTEIN C16C10.5 IN CHROMOSOME III.//2.40E-28//1 33aa//46%// 
Q09251 

FCBBF30179180 
FCBBF301 79740 
25 FCBBF30181730 
FCBBF301 94370 

FCBBF30194550//ANKYRIN 1 (ERYTHROCYTE ANKYRIN) (ANKYRIN R) (ANKYRINS 2.1 AND 2.2).//9.90E-54// 
478aa//33%//P16157 

FCBBF301 95690//SYNAPTOTAGMIN 17/1 .30E-27//138aa//31%//P34693 
30 FCBBF301 95700 

FCBBF301 97840//Mouse mRNA for seizure-related gene product 6 type 2 precursor, complete cds7/0//869aa//91 %// 
D64009 

FCBBF30198670//dof protein - fruit fly (Drosophila melanogaster)//6.60E-05//272aa//24%//T13712 
FCBBF30201 630//Chlamydomonas reinhardtii dhd gene for 1 -alpha dynein heavy chain.//1 .10E-121//384aa//5896// 
35 AJ243806 

FCBBF30212210 

FCBBF30215240//mitogen inducible gene mig-2 - human//2.20E-135//263aa//96%//S69890 

FCBBF30220050//OXYSTEROLS RECEPTOR LXR-BETA (LIVER X RECEPTOR BETA) (NUCLEAR ORPHAN RE- 
CEPTOR LXR-BETA) (UBIQUITOUSLY-EXPRESSED NUCLEAR RECEPTOR) (NUCLEAR RECEPTOR NER).// 
40 6.90 E-96//1 67aa//1 00%//P55055 

FCBBF30222910//Mus musculus Rap2 interacting protein 8 (RPIP8) mRNA, complete cds.//4.70E-29//76aa//46%// 
U73941 

FCBBF30223110 

FCBBF30223210//PLEXIN 4 PRECURSOR (TRANSMEMBRANE PROTEIN SEX)7/1 .70E-72//179aa//76%//P51805 
« FCBBF30225930 

FCBBF30228940//Homo sapiens zinc finger protein dp mRNA, complete cds.//2.60E-14//114aa//42%//AF 153201 
FCBBF30230610 

FCBBF30236670//Homo sapiens DEAD-box protein abstrakt (ABS) mRNA, complete cds.//1 .00E-128//276aa//91%// 
AF195417 

50 FCBBF30250980//GUANINE NUCLEOTIDE-BINDING PROTEIN BETA SUBUNIT-LIKE PROTEIN7/3.20E-06// 
190aa//28%//024076 

FCBBF30255680//Rattus norvegicus brain specific cortactin-binding protein CBP90 mRNA, partial cds.//1 .80E-275// 
641 aa//82%//AF053768 

FCBBF30257370//CARNITINE DEFICIENCY-ASSOCIATED PROTEIN EXPRESSED IN VENTRICLE 1 (CDV-1 PRO- 
55 TEIN)7/2.80E-1 69//355aa//92%//035594 

FCBBF30259050//Mus musculus (clone pMLZ-1) zinc finger protein (Zfp) mRNA, 3' end of cds.//1 .40E-241//499aa// 
83%//L36315 

FCBBF30260210//Drosophila melanogaster KISMET-L long isoform (kis) mRNA, complete cds.//3.90E-178//420aa// 
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68%//AF215703 

FCBBF30260480//Mus musculus putative E1-E2 ATPase mRNA, partial cds.//1 .80E-78//154aa//95%//AF 156547 
FCBBF30263080//ZINC FINGER PROTEIN 83 (ZINC FINGER PROTEIN HPF1).//4.20E-33//107aa//58%//P51522' 
FCBBF30266510 

5 FCBBF30271 990//ANKYRIN 1 (ERYTHROCYTE ANKYRIN).//3.00E-43//419aa//33%//Q02357 

FCBBF30275590//dedicator of cyto-kinesis 1 [Homo Sapiens]7/1 .OOE-138//791aa//37%//NP_001371 
FCBBF30282020//cca3 protein - rat//5.50E-249//492aa//94%//T31081 

FCBBF30285930//ZINC FINGER PROTEIN ZFP-1 (MKR1 PROTEIN).//5.70E-68//125aa//97%//P08042 
FCBBF30287940 

w FCBBF40000610//late gestation lung 2 protein [Rattus norvegicus]7/5.00E-86//178aa//94%//AAF44721 
FCBBF40001920 
FCBBF40005000 
FCBBF50000410 
FCBBF50000610 

15 FCBBF50001 650//Homo sapiens JP3 mRNA for junctophilin type3, complete cds.//1 .20E-1 1 1//407aa//57%//AB042636 
FCBBF50003530//H.sapiens mRNA for dinG gene.//2.70E-137//1 81 aa//100%//Y 10571 
FCBBF50004950 

FEBRA20005040//MYOSIN HEAVY CHAIN, STRIATED MUSCLE.//1 .90E-1 3//479aa//23%//P24733 
FEBRA20007820//MLN 64 PROTEIN (CAB1 PROTEIN).//7.00E-31//129aa//51%//Q14849 
20 FEBRA20018670 

FEBRA20026820//ZINC FINGER PROTEIN 91 (ZINC FINGER PROTEIN HTF10) (HPF7).//2.20E-135//431 aa//50%// 
Q05481 

FEBRA20027070//ZINC FINGER PROTEIN 41 ( FRAGMENT) 7/1 .00 E-139//333aa//70%//P51 81 4 
FEBRA20029620 

25 FEBRA20031 000//TRICHOHYALIN.//2.20E-1 6//360aa//26%//P37709 

FEBRA20031150//Homo sapiens HSKM-B (HSKM-B) mRNA, complete cds.//2.00E-29//63aa//1 00%//AF226053 

FEBRA20031280 

FEBRA20031810 

FEBRA20035200 

30 FEBRA20035240//SPLICEOSOME ASSOCIATED PROTEIN 49 (SAP 49) (SF3B53).//7.30E-05//108aa//31%// 
Q 15427 

FEBRA20038220//GLUCOAMYLASE S1/S2 PRECURSOR (EC 3.2.1.3) (GLUCAN 1 ,4-ALPHA-GLUCOSIDASE) 

(1 ,4-ALPHA-D-GLUCAN GLUCOHYDROLASE).//7.50E-05//256aa//24%//P08640 

FEBRA20038330 

35 FEBRA20038970//Homo sapiens mRNA for stabilin-1 (stabl gene).//1 .30E-242//413aa//99%//AJ275213 
FEBRA20039070 

FEBRA20039260//NonF [Streptomyces griseus subsp. griseus]y/2.20E-16//140aa//38%//AAD37457 
FEBRA20040230 
FEBRA20040260 
40 FEBRA20040290 

FEBRA20040560//Homo sapiens delta-6 fatty acid desaturase (CYB5RP) mRNA, complete cds.//1 .70E-112//204aa// 
100%//AF 134404 

FEBRA20045380//EVI-5 homolog [Homo sapiens] .//7.00E-49//130aa//81 %//AAC16031 

FEBRA20046200//ANKYRIN 2 (BRAIN ANKYRIN) (ANKYRIN B) (ANKYRIN, NONERYTHROID).//1 .70E-25//368aa// 
45 30%//Q01484 

FEBRA20046280 

FEBRA20046510//ZINC FINGER PROTEIN 135.//1 .10E-94//260aa//62%//P52742 
FEBRA20057010//ZINC FINGER PROTEIN 1 95.//1 .30E-12//47aa//70°///014628 

FEBRA20063720//ZINC FINGER PROTEIN 85 (ZINC FINGER PROTEIN HPF4) (HTF1).//3.10E-243//586aa//73%// 
50 Q03923 

FEBRA20076200 
FEBRA20078180 

FEBRA20078800//NADH-UBIQUINONE OXIDOREDUCTASE 20 KDA SUBUNIT PRECURSOR (EC 1.6.5.3) (EC 1. 
6. 99. 3) (COMPLEX I-20KD) (CI-20KD) (PSST SUBUNIT).//2.30E-96//192aa//96%//075251 
55 FEBRA20080860 
FEBRA20082660 
FEBRA20083410 
FEBRA20084750 
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FEBRA20086600 

FEBRA20087550//damage-specific DNA binding protein 2 (48kD) [Homo sapiens]//1.10E-106//119aa//94%// 
NP_000098 

FEBRA2008861 0//CELLULAR RETIN ALDEHYDE-BINDING PROTEIN (CRALBP).//2.40E-14//145aa//30%//P10123 
5 FEBRA20088810//FIBROBL GROWTH FACTOR-17 PRECURSOR (FGF-1 7).//1 .00E-1 02//1 93aa//99%//060258 

FEBRA200901 60//MITOG EN-ACTIVATED PROTEIN KINASE KINASE KINASE 10 (EC 2.7.1.-) (MIXED LINEAGE 
KINASE 2) (PROTEIN KINASE MST).//1 .10E-15//111aa//48%//Q02779 

FEBRA20090220//TRANSLATION INITIATION FACTOR EIF-2B EPSILON SUBUNIT (EIF-2B GDP-GTP EXCHANGE 
FACTO R) .//0//72 1 aa//90%//P47823 
w FEBRA20091620 

FEBRA20092760//PINCH PROTEIN ( PARTI CULARY INTERESTING NEW CYS-HIS PROTEIN).//2.50E-81//1 65aa// 
80%//P48059 
FEBRA20093270 
FEBRA20093280 
15 FEBRA20095410 
FEBRA20098040 

FEBRA20099860//dynactin 3 (p22); dynactin light chain [Homo Sapiens]//1 .70E-41//89aa//1 00%//NP„009165 
FEBRA20101410 
FEBRA201 08020 
20 FEBRA201 08580 
FEBRA201 15930 
FEBRA201 16650 
FEBRA20121200 

FEBRA20121950//X INACTIVE SPECIFIC TRANSCRIPT PROTEIN (FRAGMENT) V/1 .60E-07//155aa//24%//P27571 
25 FEBRA20141980 

FEBRA20150420//HYPOTHETICAL 131 .5 KDA PROTEIN C02F12.7 IN CHROMOSOME X.//6. 90 E-56//877aa//24%// 
Q11102 

FEBRA20151750//Mus musculus (clone E5.53) Huntington disease (hdh) gene, exon 5.//2.60E-12//88aa//43%// 
L34024 
30 FEBRA201 63980 

FEBRA20170240//ZINC FINGER PROTEIN 75.//7.90E-158//278aa//99%//P51815 

FEBRA20172230//Mus musculus schwannoma-associated protein (SAM9) mRNA, complete cds.//1 .70E-57//295aa// 
40%//AF026124 

FEBRA20173330//PROTEIN KINASE CLK3 (EC 2.7.1 .-)y/4.80E-277//490aa7/99%//P49761 
35 FEBRA201 75020 
FEBRA201 75330 

FEBRA20177800//RNA binding motif protein 9 [Homo sapiens]. //4.00E-09//75aa//95%//NP_055 124 
FEBRA201 80510 
FEBRA201 82030 
40 FEBRA201 87460 

FEBRA201 91 720//REGULATOR OF G-PROTEIN SIGNALING 11 (RGS1 1 ) 7/2.00 E-73//1 04aa//1 00%//09481 0 
HCHON1 00001 50//SKELETAL MUSCLE LIM-PROTEIN 1 (SLIM 1) (SLIM) (FOUR AND A HALF LIM DOMAINS PRO- 
TEIN 1) (FHL-1).//4.10E-74//154aa//84%//Q13642 
HCHON1 0001 660 

45 HCHON20000870//SERINE/THREONINE-PROTEIN KINASE CTR1 (EC 2.7.1 .37)7/2. 10 E-21//300aa//26%//Q 0560 9 
HCHON20002650//EARLY GROWTH RESPONSE PROTEIN 2 (EGR-2) (KROX-20 PROTEIN).//9.90E-05//1 66aa// 
26%//P51 774 

HCHON20002710//UBIQUITIN CARBOXYL-TERMINAL HYDROLASE 13 (EC 3.1.2.15) (UBIQUITIN THIOLESTE- 
RASE 13) (UBIQUITIN-SPECIFIC PROCESSING PROTEASE 13) (DEUBIQUITINATING ENZYME 13).//7.40E-10// 
50 1 1 4aa//28%//P381 87 

HCHON20015050//LEUKOCYTE ADHESION GLYCOPROTEIN P1 50,95 ALPHA CHAIN PRECURSOR (LEUKO- 
CYTE ADHESION RECEPTOR P1 50,95) (CD11C) (LEU M5) (INTEGRIN ALP HA-X).//8. 60 E-06//250aa//26%//P 20702 
HEART10001420//Mus musculus skm-BOP1 (Bop) mRNA, complete cds.//6.10E-259//485aa//94%//U 76373 
HEART10001490//ACTIN INTERACTING PROTEIN 2.//1 .80E-71//243aa//58%//P46681 
55 HEART20009590//Homo sapiens mRNA for paraplegin-like protein.//7.10E-47//145aa//67%//Y1 8314 

HEART20019310//Mus musculus RING-finger protein MURF mRNA, complete cds.//6.70E-118//362aa//61%// 
AF294790 

HEART20022200//METHIONINE AMINOPEPTIDASE 2 (EC 3.4.11.18) (METAP 2) (PEPTIDASE M 2) (INITIATION 
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FACTOR 2 ASSOCIATED 67 KDA GLYCOPROTEIN) (P67).//1 .50E-209//447aa//86%//P50579 
HEART20031680 

HEART20047640//CALCIUM/CALMODULIN-DEPENDENT 3\5'-CYCLIC NUCLEOTIDE PHOSPHODIESTERASE 
1C (EC 3.1.4.17) (CAM-PDE 1C).//0//769aa//94%//Q 63421 
5 HEART20063100//H. sapiens mRNA histone RNA hairpin-binding protein.//5.60E-114/%212aa//100%//Z71188 

HEART20082570//AMINOMETHYLTRANSFERASE PRECURSOR (EC 2.1 .2.10) (GLYCINE CLEAVAGE SYSTEM T 

P ROTE I N) 7/5 .50 E- 1 1 3//2 1 0 aa//69%//P2 8337 

HHDPC10001140 

HHDPC20051850//STEROID RECEPTOR PROTEIN DG67/9.50E-43//101 aa//89%//015173 
10 HHDPC20081230//NUCL (PROTEIN C23).//0//681aa//92%//P19338 
HHDPC20082790 
HHDPC20082970 
HHDPC20088160 

HLUNG20008460//DIAPHANOUS PROTEIN HOMOLOG 2.//7.60E-33//521aa//26%//060879 
15 HLUNG20009260 
HLUNG20009550 
HLUNG20010130 

HLUNG20011260//TYROSINE-PROTEIN KINASE SRC-1 (EC 2.7.1.112) (P60-SRC-1).//1 .10E-46//92aa//100%// 
P13115 
20 HLUNG20011440 

HLUNG20011460//Rattus norvegicus serine-arginine-rich splicing regulatory protein SRRP86 mRNA, complete cds.// 

1 .20E-159//398aa//79%//AF234765 

HLUNG20012140 

HLUNG20014590//ZINC FINGER PROTEIN 135.//1 .20E-122//350aa//59%//P52742 
25 HLUNG20015070//SLIT PROTEIN PRECURSOR.//5.00E-1 4//1 67aa//33%//P24014 

HLUNG20015180//BALBIANI RING PROTEIN 3 PRECURSOR.//8.80E-08//444aa//24%//Q03376 
HLUNG20020500 

HLUNG20020850//TLM PROTEIN (TLM ONCOGENE)7/5.00E-1 7//91aa//54%//P1 7408 
HLUNG20021450 
30 HLUNG20023030 
HLUNG20024050 
HLUNG20025620 

HLUNG2002811 0//zinc finger protein - fission yeast (Schizosaccharomyces pombe).//2.70E-23//140aa//38%//T39456 
HLUNG20029420 
35 HLUNG20029490 

HLUNG20030420//Mus musculus mRNA for MAIL, complete cds.//1 .OOE-164//728aa//68%//AB020974 
HLUNG20030490//Ambystoma tigrinum RPE65 protein mRNA, complete cds.//1 .70E-64//335aa//42%//AF047465 
HLUNG20030610 
HLUNG20031620 

40 HLUNG20032460//LYSOSOMAL PRO-X CARBOXYPEPTIDASE PRECURSOR (EC 3.4.16.2) (PROLYLCARBOX- 
YPEPTIDASE) (PRCP) (PROLINE CARBOXYPEPTIDASE) (ANGIOTENSINASE C) (LYSOSOMAL CARBOX- 
YPEPTIDASE C) .//8 . 60 E - 274//440 aa//9 9%// P42 785 

HLUNG20033060//Homo sapiens GAP-like protein (N61) mRNA, complete cds.//1 .20E-81//389aa//45%//AF251038 
HLUNG20033310 
45 HLUNG20033350 
HLUNG20034970 
HLUNG20037140 

HLUNG20037160//RETROVIRUS-RELATED ENV POLYPROTEIN.//1 .90E-131//439aa//54%//P1 0267 
HLUNG20037780 
50 HLUNG20038330 

HLUNG20041540//GLUCO AMYLASE S1/S2 PRECURSOR (EC 3.2.1.3) (GLUCAN 1 ,4-ALPHA-GLUCOSIDASE) 
(1 ,4-ALPHA-D-GLUCAN GLUCOHYDROLASE).//8.80E-08//286aa//24%//P08640 

HLUNG20041590//ubiquitous tetratricopeptide containing protein RoXaN [Homo sapiens].//1 .00E-158//737aa//42%// 
AAF05541 

55 H LU N G20 042 730//CYTOCH ROME P450 4A4 (EC 1.14.14.1) (CYPIVA4) (PROSTAGLANDIN OMEGA- HYDROXY- 
LASE) (P450-P-2) 7/4.90 E-1 26//442aa//49%//P 1 061 1 

HLUNG20045340//MOB2 PROTEIN (MPS1 BINDER 2) 7/4.60 E-27//135aa//37%//P43563 
HLUNG20047070 
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HLUNG20050760 
HLUNG20051330 

HLUNG20052300//AIG1 PROTEIN.//3.00E-23//21 6aa//30%//P54120 

HLUNG20054790//PHOSPHAT1DYL1NOSITOL 3-KINASE REGULATORY SUSUNIT (EC 2.7.1.137) (IB PI3-KINASE 
5 P101 SUBUNIT) (PTDINS-3-KINASE P101) (PI3K) (P1 01-PI3K).//8.30E-22//292aa//25%//002696 

HLUNG20055240 

HLUNG20056560 

HLUNG20057380 

HLUNG20059240 
10 HLUNG20060670 

HLUNG20063700//H .sapiens PEBP2aC1 acute myeloid leukaemia mRNA.//3.60E-22//62aa//79%//Z35278 
HLUNG20065700 

H LUNG20065990//SYNTAX IN 4.//8.80E-127//267aa//96%//Q1 2846 
HLUNG20067810 

15 HLUNG20068120//NUCLEAR TRANSITION PROTEIN 2 (TP-2).//7.1 0E-06//86aa//38%//P111 01 
HLUNG20069350//CALCYPHOSINE.//1.80E-13//128aa//31%//Q13938 
HLUNG20070410 

HLUNG20072100//Gallus gallus Dach2 protein (Dach2) mRNA, complete cds.//1 .40 E-236//404aa//79%//AF 198349 
HLUNG20072190 
20 HLUNG20072450 
HLUNG20074330 
HLUNG20079310 

HLUNG20081390//DNAJ PROTEIN7/1 .60E-1 7//98aa//47%//P35515 

HLUNG20081530//NEUROFILAMENT TRIPLET H PROTEIN (200 KDA NEUROFILAMENT PROTEIN) (NF-H).// 
25 1 .90E-09//220aa//25%//P1 2036 

HLUNG20082350//Homo sapiens goodpasture antigen-binding protein (COL4A3BP) mRNA, complete cds.//0//399aa// 
93%//AF 136450 

HLUNG20083330//alphal (III) collagen [Homo Sapiens]//5.40E-61//113aa//99%//CAA29886 

HLUNG20083480//Chicken mRNA for TSC-22 variant, complete cds, clone SLFEST52.//9.00E-178//527aa//68%// 
30 D82364 

HLUNG20083840 
HLUNG20083960 

H LUNG20084790//H Y POTH ETI C AL 65.2 KDA TRP-ASP REPEATS CONTAINING PROTEIN D2030.9 IN CHROMO- 
SOME l.//5.50E-47//161aa//53%//P90794 

35 HLUNG2008521 0//Homo sapiens intersectin 2 (SH3D1 B) mRNA, complete cds.//9.60E-28/1 62aa//95%//AF248540 
HLUNG20088750 
HLUNG20092530 
HLUNG20093030 
HLUNG20094130 

40 KIDNE20011600 

KIDNE20016360//Rattus norvegicus potassium channel (erg2) mRNA, complete cds7/0//418aa//96%//AF016192 

KIDNE20024380 

KIDNE20027980 

KIDNE20080690//PROBABLE AMINOTRANSFERASE T01 B11 .2 (EC 2.6.1 .-) . //5. 80 E- 11 4//445aa//48%//P9 1408 
45 KIDNE20081170//Homo sapiens microtubule-based motor (HsKIFC3) mRNA, complete cds.//6.20E-153//216aa// 
99%//AF004426 
KIDNE20083150 

KIDNE20083620//L-ASPARAGINASE (EC 3.5.1. 1) (L-ASPAR AGIN E AMI DO HYDROLASE) 7/2. 60E-53//148aa//43%// 
Q9ZSD6 
50 KIDNE20084030 

KIDNE20084040//PHOSPHOLIPASE D1 (EC 3.1 .4.4) (PLD 1) (CHOLINE PHOSPHATASE 1 ) (PHOSPHATIDYLCHO- 
LINE-HYDROLYZING PHOSPHOLIPASE D1).//3.30E-70//134aa//100%//Q 13393 

KIDNE20084730//Homo sapiens FH1/FH2 domain-containing protein FHOS (FHOS) mRNA, complete cds.//2.60E- 
1 48//599aa//52%//AF1 1 361 5 
55 KIDNE20084800 
KIDNE20086490 
KIDNE20086660 
KIDNE20086970 
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KIDNE20087880 

KIDNE20088240//atopy related autoantigen CALC [Homo sapiens].//1 .00E-26//300aa//26%//CAA76830 
KIDNE20089870//HISTONE ACETYLTRANSFE RASE TYPE BSUBUNIT2 (RETINOBLASTOMA BINDING PROTEIN 
P46) (RETINOBLASTOMA-BINDING PROTEIN 7).//4.30E-237//422aa//99%//Q16576 
5 KIDNE20091090 
KIDNE20094260 

KIDNE20094670//Drosophila melanogaster AAA family protein Bor (bor) mRNA, complete cds.//2.10E-124//399aa// 

59%//AF227209 

KIDNE20095530 

10 KIDNE20133460//Homo sapiens mRNA for sperm protein. //3. 40 E-146//284aa//1 00%//X91 879 
KIDNE201 33880 
KIDNE20134130 

KIDNE20134890//CENTROMERIC PROTEIN E (CENP-E PROTEIN).//7.70E-05//169aa//21%//Q02224 
K1DNE201 37310 
15 KIDNE201 38450 

KlDNE20140870//zinc finger protein 106 [Mus musculus]//2.10E-288//822aa//67%//AF060246 
KIDNE20141120 

KIDNE20141700//40S RIBOSOMAL PROTEIN S4, X ISOFORM (SINGLE COPY ABUNDANT MRNA PROTEIN) 
(SCR10)y/2.60E-72//153aa//89%//P12750 
20 KIDNE201 42680 

KIDNE20142900/JTHROMBOMODULIN PRECURSOR (FETOMODULIN) (TM) (CD141 ANTIGEN).//1 .80E-71// 
1 1 9aa//1 00%//P07204 
KIDNE20 143200 

KIDNE20147170//acetylglutamate synthase - fission yeast (Schizosaccharomyces pombe)//8.40E-15//143aa//37%// 
25 T40666 

KIDNE20148080 

KIDNE20149780//NG28 [Mus musculus]//3.50E-66//367aa//44%//AAC97966 

KIDNE20150730//REGULATOR OF MITOTIC SPINDLE ASSEMBLY 1 (RMSA-1 ).//2.40E-06//84aa//41%//P49646 
KIDNE20152440//Homo sapiens mRNA for serin protease with IGF-binding motif, complete cds.//1 .80E-181//388aa// 
30 93%//D87258 

KIDNE20154330//Rattus norvegicus mRNA for multi PDZ domain protein.//0//763aa//87%//AJ001320 

KIDNE20154830 

KIDNE20155980 

KIDNE20157100 

35 KIDNE20160360//ARF NUCLEOTIDE-BINDING SITE OPENER (ARNO PROTEIN) (ARF EXCHANGE FACTOR).// 
7. 1 0E-40//1 94aa//41 %//Q9941 8 
KIDNE201 60960 
KIDNE201 63710 

KIDNE20165390//Homo sapiens mRNA for beta-tubulin folding cofactor D.//0//709aa//94%//AJ006417 
40 KIDNE20169180//UROMODULIN PRECURSOR (TAMM-HORSFALL URINARY GLYCOPROTEIN) (THP).//0// 
615aa//99%//P07911 
KIDNE201 70400 

KIDNE20173150//Bos taurus mRNA for mitochondrial aralkyl acylCoA:amino acid N -acyltransf erase. //2. 90 E-53// 
277aa//40%//AJ22330 1 

45 KIDNE20173430//Homo sapiens PDZ domain containing-protein (PDZK1) mRNA, complete cds.//7.90E-28//150aa// 
34%//AF012281 
KIDNE201 76030 
KIDNE20181670 
KIDNE201 82540 

50 KIDN E201 861 70//UDP-GLUCURONOSYLTRANSFERASE 2B13 PRECURSOR, MICROSOMAL (EC 2.4.1.17) (UD- 
PGT) (EGT1 0)7/4 .40E-38//21 4aa//39%//P3651 2 
KIDN E201 88630 

KIDN E201 89890//Homo sapiens mRNA for KARP-1 -binding protein 2 (KAB2), complete cds.//6.00E-30//1 77aa//44%// 
AB022658 

55 KIDN E201 89960//TREHALASE PRECURSOR (EC 3.2.1.28) (ALPHA, ALPHA-TREHALASE) (ALPHA, ALPHA-TRE- 
HALOSE GLUCOHYDROLASE)y/1 .40E-224//421 aa//97%//043280 
KIDNE20191870 

LIVER20006260//Mus musculus zinc finger protein ZFP113 mRNA, complete cds.//4.50E-183//385aa//8596// 
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AF1 67320 
LIVER20007690 
LIVER20007750 
LIVER20010510 

5 LIVER20010760//Homo sapiens C-type lectin-like receptor- 1 mRNA, complete cds.//3, 10E-134//208aa//100%// 
AF200949 

LIVER20010990//Rattus norvegicus mRNA for putative integral membrane transport protein (UST1 r).//7.00E-52// 
1 96aa//54%//Y09945 

LIVER20011640//Human proline rich calmodulin-dependent protein kinase mRNA, complete cds.//2.00E-116//221 aa// 
w 97%//U23460 

LIVER20013890 

LIVER20026440//CYTOCHROME P450 4F3 (EC 1.14.13.30) (CYPIVF3) (LEUKOTRIENE-B4 OMEGA- HYDROXY- 
LASE) (LEUKOTRIENE-B4 20-MONOOXYGENASE) (CYTOCHROME P450- LTB-OMEGA).//2.60E-136//295aa// 
84%//Q08477 

is LIVER20030650//WHITE PROTEIN.//7.20E-09//229aa//25%//Q05360 
LIVER20032340 

LIVER20038000//MITOCHONDRIAL CARNITINE/ACYLCARNITINE CARRIER PROTEIN (CARN ITI NE/ACYLCAR- 
NITINE TRANSLOCASE) (CAC).//9.40E-40//148aa//38%//P97521 

LIVER20040740//RETINAL-BINDING PROTEIN (RALBP).//3.10E-60//337aa//37%//P49193 
20 LIVER20055270//SELENIDE, WATER DIKINASE 2 (EC 2.7.9.3) (SELENOPHOSPHATE SYNTHETASE 2) (SELENI- 
UM DONOR PROTEIN 2).//2.70E-204//376aa//97%//Q99611 

MESAN20006200//ANNEXIN II (LIPOCORTIN II) (CALPACTIN I HEAVY CHAIN) (CHROMOBINDIN 8) (P36) (PRO- 
TEIN I) (PLACENTAL ANTICOAGULANT PROTEIN IV) (PAP-IV)V/1 .70E-84//174aa//95%//P07355 
MESAN20007110 
25 MESAN20008150 
MESAN20008940 

MESAN20009090//Homo sapiens CEGP1 protein (CEGP1), mRNA//1 .1 0E-1 79//553aa//58%//NM_020974 
MESAN20016270//ZINC FINGER PROTEIN 37A (ZINC FINGER PROTEIN KOX21) (FRAGMENT).//1 .60E-141// 
242aa//1 00%//P1 7032 

30 MESAN20021130//Human SH3 domain-containing proline-rich kinase (sprk) mRNA, complete cds.//8.20E-168// 
346aa//91%//U07747 

MESAN20021220//INSULIN-LIKE GROWTH FACTOR BINDING PROTEIN COMPLEX ACID LABILE CHAIN PRE- 
CURSOR (ALS).//7.40E-26//1 97aa//30%//002833 

MESAN20021470//SPINDLIN (30000 MR METAPHASE COMPLEX) (SSEC P).//3.50E-123//229aa//98%//Q611 42 
35 MESAN20021860 
MESAN20026870 

MESAN20027240//Rho guanine nucleotide exchange factor (GEF) 10 [Homo sapiens].//1 .00E-134//620aa//40%// 
NP_055444 

M ESAN20027900//COLLAG EN ALPHA 3(VI) CHAIN PRECURSOR//0//1001aa//98%//P12111 
40 MESAN20029780 

MESAN20030350//Mus musculus diaphanous-related formin (Dia2) mRNA, complete cds.//6.60E-301//669aa//84%// 

AF094519 

MESAN20030370 

MESAN20030390 

45 MESAN20033220//ALDEHYDE DEHYDROGENASE 7 (EC 1 .2.1 .5).//1 .60E-24//54aa//1 00%//P43353 

MESAN20034440//39.1 KDA PROTEIN IN SURE-CYSC INTERGENIC REGION.//2.70E-07//11 7aa//31%//Q57261 
MESAN20038520//DNA-DIRECTED RNA POLYMERASE III 128 KDA POLYPEPTIDE (EC 2.7.7.6) (RNA POLYMER- 
ASE III SUBUNIT 2)y/0//831aa//70%//P25167 
MESAN20041380 

50 MESAN20045750 

MESAN20056890//SPLICING FACTOR, ARGININE/SERINE-RICH 2 (SPLICING FACTOR SC35) (SC-35) (SPLICING 
COMPONENT, 35 KDA) (PR264 PROTEIN).//3.30E-12//97aa//48%//Q01130 

MESAN20057240//DNA EXCISION REPAIR PROTEIN ERCC-1 .//5.90E-120//195aa//9896//P07992 
MESAN20058110//65 KDA FK506-BINDING PROTEIN PRECURSOR (EC 5.2.1.8) (FKBP65) (FKBPRP) (PEPTI- 
55 DYL-PROLYL CIS-TRANS ISOMERASE) (PPIASE) (ROTAMASE) (IMMUNOPHILIN FKBP65).//8.00E-117//229aa// 
89%//Q61576 

MESAN20059570//Rattus norvegicus mRNA for seven transmembrane receptor, complete cds.//1.00E-173//484aa// 
63%//BAA82518 
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MESAN20060220 
MESAN20060430 

MESAN20065990//Human protein serine/threonine kinase stk2 mRNA, complete cds.//2.40E-07//65aa//50%//L20321 
MESAN20067430//TROPOMYOSIN, FIBROBLAST ISOFORM TM3.//1 .80E-39//87aa//100%//P09494 
5 MESAN20069530//LIM domain only 7 isoform c [Homo Sapiens]//2.20E-286//545aa//99%//NP_056667 

MESAN20084150//Mus musculus secretory carrier membrane protein 4 mRNA, complete cds.//2.20E-48//128aa// 

72%//AF224721 

MESAN20085360 

MESAN20089260 

10 MESAN20090190//CEGP1 protein [Homo sapiens].//0//880aa//57%//NP_066025 
MESAN20094180 
MESAN20095220 

MESAN20095800//PEPTIDYL- PROLYL CIS-TRANS ISOMERASE 1 0 (EC 5.2.1 .8) (PPIASE) (ROTAMASE) (CYCLO- 
PHILIN-10).//1.60E-31//150aa//46%//P52017 
15 NESOP20004520//LYMPHOCYTE-SPECIFIC PROTEIN LSP1 (PP52 PROTEIN) (52 KDA PHOSPHOPROTEIN) 
(LYMPHOCYTE-SPECIFIC ANTIGEN WP34).//3.40E-173//321aa//99%//P33241 
NESOP20005040 
NT2NE20018740 

NT2NE20018890//Homo sapiens WD-repeat protein 6 (WDR6) mRNA, complete cds.//6.60E-1 84//257aa//99%// 
20 AF099100 

NT2NE20021 860//Lytechinus variegatus embryonic blastocoelar extracellular matrix protein precursor (ECM3) mRNA, 
complete cds.//9.00E-68//466aa//36%//AF287478 

NT2NE20026200//TRANSKETOLASE (EC 2.2.1.1) (TK).//1 .80E-160//310aa//99%//P29401 
NT2NE20026510//basic protein, cytosolic - fruit fly (Drosophila melanogaster)//6.10E-35//202aa//41%//S47857 
25 NT2NE20028700 
NT2NE20033150 

NT2NE20037050//U2 SMALL NUCLEAR RIBONUCLEOPROTEIN AUXILIARY FACTOR 35 KDA SUBUNIT RELAT- 
ED-PROTEIN 2.//7.20E-08//1 09aa//38%//Q1 5696 

NT2NE20038870//ZINC FINGER X-LINKED PROTEIN ZXDA ( FRAGMENT) J/5. 90 E-153//405aa//72%//P 98 168 
30 NT2NE20039210 

NT2NE20042550//ADENYLATE KINASE, CHLOROPLAST (EC 2.7.4.3) (ATP-AMP TRANSPHOSPHORYLASE).// 
7.80E-15//153aa//28%//P431 88 
NT2NE20045190 
NT2NE20047870 
35 NT2NE20053230 

NT2NE20053950//ZINC FINGER PROTEIN 136.//6.40E-108//284aa//64%//P52737 
NT2NE20059210 

NT2NE20059680//Homo sapiens integrin cytoplasmic domain associated protein (lcap-1a) mRNA, complete cds.// 
1 .80E-44//96aa//1 00%//AF01 2023 
40 NT2NE20060750//ZINC FINGER PROTEIN ZFP-36 (FRAGMENT) 7/1 .20E-69//198aa//68%//P16415 
NT2NE20061030//ZINC FINGER PROTEIN 165.//3.00E-39//125aa//65%//P49910 
NT2NE20062880 

NT2NE20064780//A-AGGLUTININ ATTACHMENT SUBUNIT P R EC U RSO R 7/4 . 1 0 E- 05//443aa//24%//P32323 
NT2NE20066590 
45 NT2NE20069580 
NT2NE20070520 
NT2NE20073650 

NT2NE20077250//Homo sapiens cell cycle progression 2 protein (CPR2) mRNA, complete cds.//1.20E-173//349aa// 
94%//AF011792 
50 NT2NE20077270 
NT2NE20077860 

NT2NE20079670//ZINC FINGER PROTEIN 84 (ZINC FINGER PROTEIN HPF2).//2.60E-98//345aa//47%//P51523 

NT2NE20080770 

NT2NE20082130 

55 NT2NE20082600//Homo sapiens zinc finger protein dp mRNA, complete cds.//2.10E-19//163aa//42%//AF 153201 
NT2NE20086070 

NT2NE20087270//Homo sapiens putative RNA binding protein mRNA, alternatively spliced, complete cds.//4.30E-14// 
221aa//29%//AF119121 
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NT2NE20087850//ANTER-SPECIFIC PROLINE-RICH PROTEIN APG (PROTEIN CEX) (FRAGMENT) .80E-08// 

75aa//40%//P40603 

NT2NE20088030 

NT2NE20092950 

5 NT2NE20095230//Homo sapiens HSKM-B (HSKM-B) mRNA, complete cds.//1 .40E-09//112aa//32%//AF226053 
NT2NE201 04000 
NT2NE201 07810 

NT2NE20108420//KES1 PROTEIN.//4.70E-25//312aa//31%//P35844 
NT2NE20111190//C-TERMINAL BINDING PROTEIN 2.//9.00E-54//137aa//84%//P56545 
10 NT2NE201 12210 
NT2NE201 14850 

NT2NE20117580//NADH-UBIQUINONE OXIDOREDUCTASE 20 KDA SUBUNIT PRECURSOR (EC 1.6.5.3) (EC 
1.6.99.3) (COMPLEX I-20KD) (CI-20KD) (PSST SUBUNIT).//1 .20E-76//153aa//9596//075251 

NT2NE20119980//TUMOR NECROSIS FACTOR, ALPHA-INDUCED PROTEIN 1, ENDOTHELIAL (B12 PROTEIN).// 
15 6.1 0E-48//1 35aa//74%//Q1 3829 
NT2NE201 23610 
NT2NE201 24570 
NT2NE201 26030 

NT2NE201 27900//Homo sapiens myoinositol 1 -phosphate synthase A1 (ISYNA1 ) mRNA, complete cds.//1 .40E-269// 
20 465aa//99%//AF220530 

NT2NE20140130//SEMAPHORIN 3B PRECURSOR (SEMAPHORIN V) (SEMA V).//1 .90E-42//90aa//100%//Q13214 
NT2NE201 40280 

NT2NE20141040//DOWN SYNDROME CRITICAL REGION PROTEIN 1 .//1 .80E-105//197aa//99%//P53805 
NT2NE20145250//SCG10 PROTEIN (SUPERIOR CERVICAL GANGLION-10 PROTEIN).//2.50E-58//141 aa//88%// 
25 P55821 

NT2NE20146510//HYPOTH ETICAL 104.7 KDA PROTEIN F23F12.8 IN CHROMOSOME III PRECURSOR.//6.20E- 
08//1 79aa//23%//P46504 
NT2NE201 48690 
NT2NE20 149500 
30 NT2NE201 50610 
NT2NE201 52620 

NT2NE20153620//mitogen inducible gene mig-2 - h uman//7. 90 E-147//449aa//63%//S 69890 
NT2NE201 55650//RETROV I RUS-R ELATED ENV POLYPROTEIN.//2.30E-29//114aa//33%//P 10267 
NT2NE20157120 
35 NT2NE20165190 

NT2NE201 67660//Mus musculus nuclear localization signal binding protein (spot-1) mRNA, complete cds.//3.50E-09// 
76aa//40%//S79410 

NT2NE20173970//Rattus norvegicus beta-catenin binding protein mRNA, complete cds.//1 .60E-24//134aa//46%// 
AF1 69825 

40 NT2NE201 7721 0//Leishmania major partial ppgl gene for proteophosphoglycan.//1 .1 0E-06//1 69aa//27%//AJ243460 

NT2NE20181760 

NT2NE20181800 

NT2NE201 84720 

NT2R 12001 6240 
45 NT2RI20021200 

NT2RI20033920 

NT2RI2009301 0//BI FUNCTIONAL M ETHYL ENETETRAHYDRO FOLATE DEHYDROGENASE/CYCLOHYDROLA- 
SE, MITOCHONDRIAL PRECURSOR [INCLUDES: NAD-DEPENDENT METHYLENETETRAHYDROFOLATE DEHY- 
DROGENASE (EC 1.5.1.15); METHENYLTETRAHYDROFOLATE CYCLOHYDROLASE (EC 3.5.4.9)] .//5.90E-34// 
50 86aa//81%//P13995 

NT2RP70001120//GA BINDING PROTEIN BETA-1 CHAIN (GABP-BETA-1 SUBUNIT) (GABPB1).//2.50E-1 13// 
384aa//62%//Q00420 

NT2RP70001730//Mus musculus actin-binding protein (ENC-1) mRNA, complete cds.//4.30E-249//589aa//74%// 
U65079 

55 NT2RP70003110//ELASTIN PRECURSOR (TROPOELASTIN).//1 .40E-1 65//613aa//61%//P 15502 
NT2RP70012830//CALPHOTIN.//7.90E-17//445aa//28%//Q02910 
NT2RP70022820 

NT2RP70027790//RAS SUPPRESSOR PROTEIN 1 (RSU-1) (RSP-1 PROTEIN) (RSP-1).//3.80E-23//186aa//34%// 
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Q01730 

NT2RP70029780//Z1NC FINGER PROTEIN ZFP-36 (FRAGMENT)V/4.30E-118//381 aa//56%//P16415 
NT2RP70030840//Mus musculus schlafen3 (S1fn3) mRNA, complete cds.//3.70E-55//328aa//33%//AF099974 
NT2RP70031 070//36 KDA NUCLEOLAR PROTEIN HNP36 (DELAYED-EARLY RESPONSE PROTEIN 12) (DER12).// 
5 1 .20E-23//1 69aa//34%//Q61 672 
NT2RP70031340 
NT2RP70031480 

NT2RP70035110//Caenorhabditis elegans UNC-89 (unc-89) gene, complete cds.//1 .50E-07//229aa//26%//U33058 
NT2RP70046410//BASONUCLIN.//3.60E-71//318aa//43%//Q01954 
10 NT2RP70049610 
NT2RP70056290 

NT2RP70056690//F-SPONDIN PRECURSOR V/2.20E-15//366aa//24%//P35447 
NT2RP70057500//Hypothetlcal zinc finger-like protein [Homo sapiens].//0//799aa//94%//AAF88107 
NT2RP70064570//CALPAIN P94, LARGE [CATALYTIC] SUBUNIT (EC 3.4.22.17) (CALCIUM- ACTIVATED NEUTRAL 
15 PROTEINASE) (CANP) (P94 PROTEIN) (MUSCLE-SPECIFIC CALCIUM -ACTIVATED NEUTRAL PROTEASE 3 
LARGE SUBUNIT).//9.40E-86//278aa//40%//Q64691 
NT2RP70074800 

NT2RP70075300//ZINC FINGER PROTEIN 211 (ZINC FINGER PROTEIN C2H2-25).//9.60E-121//333aa//63%// 
Q 13398 

20 NT2RP70075800//HYPOTHETICAL 43.1 KDA TRP-ASP REPEATS CONTAINING PROTEIN K04G11.4 IN CHRO- 
MOSOME X.//1 .80E-13//244aa//25%//Q93847 
NT2RP70080150 
NT2RP70084540 

NT2RP70087140//SKIN SECRETORY PROTEIN XP2 PRECURSOR (APEG PROTEIN).//1 .40E-11//264aa//31%// 
25 P 17437 

NT2RP70090870//ZINC FINGER PROTEIN 91 (ZINC FINGER PROTEIN HTF10) (HPF7).//4.20E-230//592aa//61%// 
Q05481 

NTONG20002230//Mus musculus RW1 protein mRNA, complete cds.//5.20E-97//546aa//34%//AF060565 
NTONG20005310 
30 NTONG20017620 

NTONG20029850//CALCYPHOSINE (R2D5 ANTIGEN).//1 .60E-24//183aa//32%//P41150 

NTONG20031580//heat shock 27kD protein family, member 7 (cardiovascular); cardiovascular heat shock protein [Ho- 
mo sapiens]//6.10E-69//141aa//95%//NP_055239 

NTONG20032100//KERATIN, TYPE I CYTOSKELETAL 13 (CYTOKERATIN 13) (K13) (CK 13).//4.20E-175//351aa// 
35 96%//P13646 

NTONG20034540//CGMP-DEPENDENT 3\5'-CYCLIC PHOSPHODIESTERASE (EC 3. 1. 4. 17) (CYCLIC GMP 
STIMULATED PHOSPHODIESTERASE) (CGS-PDE).//0//713aa//99%//000408 
NTONG20035150//RING CANAL PROTEIN (KELCH PROTEIN).//9.10E-30//570aa//25%//Q04652 
NTONG20043080//MYOSIN LIGHT CHAIN KINASE, SMOOTH MUSCLE AND NON-MUSCLE ISOZYMES (EC 
40 2.7.1 .117) (MLCK) [CONTAINS: TELOKIN].//4.30E-12//226aa//28%//P11799 

NTONG20048440//P116 RHO-INTERACTING PROTEIN (P116RIP) (RIP3).//1 .60E-269//588aa//87%//P97434 
NTONG20049180 

NTONG20053630//GLUCOAMYLASE S1/S2 PRECURSOR (EC 3.2.1.3) (GLUCAN 1 ,4-ALPHA-GLUCOSIDASE) 
(1 ,4-ALPHA-D-GLUCAN GLUCOHYDROLASE).//1 .30E-1 2//247aa//29%//P08640 
45 NTONG20053730//UBIQUITIN CARBOXYL-TERMINAL HYDROLASE 64E (EC 3.1 .2.15) (UBIQUITIN THIOLESTE- 
RASE 64E) (UBIQUITIN-SPECIFIC PROCESSING PROTEASE 64E) (DEUBIQUITINATING ENZYME 64E).//8.70E- 
1 04//340aa//60%//Q24574 

NTONG20053910//Bos taurus differentiation enhancing factor 1 (DEF-1) mRNA, complete cds.//2.50E-198//891aa// 
4596//AF1 12886 

50 NTONG20055200//ELONGATION FACTOR G, MITOCHONDRIAL PRECURSOR (MEF-G).//1 .1 0E-269//522aa// 
83%//Q07803 

NTONG20058010//ACETYL-COENZYMEASYNTHETASE(EC6.2.1.1)(ACETATE--COA LIGASE) (ACYL- ACTIVAT- 
ING ENZYME).//3.90E-124//398aa//54%//068040 

NTONG20058220//Homo sapiens phosphoprotein pp75 mRNA, partial cds.//8.60E-14//349aa//25%//AF 153085 
55 OCBBF20000740//Homo sapiens mRNA for ISLR, complete cdsV/3.60E-87//287aa//59%//AB0031 84 
OCBBF20001780 

OCBBF20005220//Rattus norvegicus Fos-related antigen mRNA, complete cds.//3.40E-233//552aa//84%//U34932 
OCBBF20009820 
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OCBBF20011860//Mus musculus epithelial protein lost in neop!asm-a (Eplin) mRNA, complete cds7/4.30E-33//98aa// 
66%//AF307844 

OCBBF2001 2520//Homo sapiens mRNA for ISLR, complete cds7/6.50E-88//287aa//59%//AB0031 84 
OCBBF20016390 

5 OCBBF2001 681 0//enhancer of polycomb [Mus muscu!us]//0//743aa//92%//AF079765 
OCBBF201 09450 
OCBBF201 09780 

OCBBF20110210//KILON PROTEIN PRECURSOR (KINDRED OF IGLON).//4.30E-182//352aa//94%//Q9ZOJ8 
OCBBF201 10730 

10 OCBBF20111370//BCL2/ADENOVIRUS E1B 19-KDA PROTEIN-INTERACTING PROTEIN 27/5.70E-84//324aa// 
52%//054940 

OCBBF20111600//69 KDA ISLET CELL AUTOANTIGEN (ICA69) (ISLET CELL AUTOANTIGEN 1).//1 .30E-1 13// 
335aa//57%//Q05084 

OCBBF20112280//Mesembryanthemum crystallinum phosphoenolpyruvate/phosphate translocator precursor (PPT) 
15 mRNA, complete cds7/7.00E-09//113aa//30%//AF223359 
OCBBF201 12320 
OCBBF20113110 
OCBBF20115360 

OCBBF20116250//Mus musculus C2H2-type zinc finger protein (Evi9) mRNA, complete cds.//0//592aa//99%// 
20 AF051525 

OCBBF201 17220 
OCBBF201 18720 
OCBBF201 19810 

OCBBF20120010//Homo sapiens zinc metalloprotease ADAMTS6 (ADAMTS6) mRNA, complete cds.//1 .60E-44// 

25 21 7aa//42%//AF1 40674 

OCBBF20120950//ZINC FINGER PROTEIN 151 (POLYOMAVIRUS LATE INITIATOR PROMOTER BINDING PRO- 
TEIN) (LP-1) (ZINC FINGER PROTEIN Z13).//1 .00E-28//201 aa//34%//Q60821 
OCBBF20121910//LAF-4 PROTEIN (LYMPHOID NUCLEAR PROTEIN).//0//740aa//98%//P51826 
OCBBF201 23200 

30 OCBBF201 42290 
OCBBF201 47070 
OCBBF201 52330 
OCBBF201 55030 

OCBBF20156450//ZINC FINGER PROTEIN 75.//8.20E-163//289aa//99%//P51815 
35 OCBBF20157970//ZINC FINGER PROTEIN 135.//2.80E-98//306aa//56%//P52742 

OCBBF20160380//llver stage antigen LSA-1 - Plasmodium falciparum//1 .10E-21//938aa//24%//A45592 . 

OCBBF20165900//CELL SURFACE ANTIGEN 114/A10 PRECURSOR7/4.80E-09//145aa//31%//P19467 

OCBBF20165910//Mus musculus pecanex 1 mRNA, complete cds7/4.20E-11 6//407aa//53%//AF096286 

OCBBF20166890//RESTIN (CYTOPLASMIC LINKER PROTEIN-170) (CLIP-170).//4.20E-21//124aa//45%//042184 
40 OCBBF20166900//ZINC FINGER PROTEIN CLONE 647 ( FRAGME NT) . //9. 00E-1 8//196aa//30%//P 15622 

OCBBF20167290//probable acyl-CoA dehydrogenase - Deinococcus radiodurans (strain R1)//2.50E-72//222aa//60%// 

D75616 

OCBBF20170350//Mus musculus mRNA for GATS protein7/2.50E-56//121aa//96%//AJ296173 

OCBBF20174580//G2/MITOTIC-SPECIFICCYCLINS13-7(B-LIKECYCLlN)(FRAGMENT).//7.10E-16//240aa//25%// 
45 P25012 

OCBBF20174890//ankyrin 3, long splice form - human//1 .1 0E-150//1 78aa//100%//A55575 

OCBBF20175360//Homo sapiens C2H2 (Kruppel-type) zinc finger protein mRNA, complete cds.//3.80E-11//101 aa// 
36%//AF 159567 
OCBBF201 76650 

50 OCBBF20177540//ZINC FINGER PROTEIN MLZ-4 (ZINC FINGER PROTEIN 46)7/1 .40E-11 0//223aa//86%//Q 03309 
OCBBF201 77910 

OCBBF20182060//Mesocricetus auratus guanine nucleotide-binding protein beta 5 (Gnb5) mRNA, complete cds7/ 
7.20E-82//265aa//61 %//U1 31 52 
OCBBF201 85630 
55 OCBBF201 88280 

OCBBF20191950//VERY LOW-DENSITY LIPOPROTEIN RECEPTOR PRECURSOR (VLDL RECEPTOR) 7/0// 
720aa//97%//P98155 

PANCR1 0000860//ELASTASE IIIB PRECURSOR (EC 3.4.21 .70) (PROTEASE E)7/1 .10E-52//87aa//97%//P08861 
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PEBLM1 0001470//g1utamine (Q)-rich factor 1 , QRF-1 [mice, B-cell leukemia, BCL1 , Peptide Partial, 84 aa]//5.00E-42// 
84aa//98%//AAB29272 

PEBLM20001800//IG ALPHA- 1 CHAIN C REGION7/7.90E-197//353aa//100%//P01876 

PEBLM20003260//ZINC FINGER PROTEIN 83 (ZINC FINGER PROTEIN HP F1 ) 7/4.90 E-62//1 51 aa//70%//P51 522 
5 PEBLM20005020 

PLAC E5000 1 290//H Y POTH ETI C AL 87.9 KDA PROTEIN F44G4.1 IN CHROMOSOME II PRECURSOR7/2.90E-16// 

102aa//43%//P54073 

PLAC E5000 1390 

PLACE60001910 

10 PLAC E60004260//CYSTATI N M PRECURSOR (CYSTATIN E)7/1 .50E-37//81 aa//97%//Q 15828 
PLACE60006300 
PLACE60011180 

PLACE60012620//LYSOSOMAL TRAFFICKING REGULATOR (BEIGE HOMOLOG).//8.40E-14//128aa//38%// 
Q99698 
15 PLACE60017120 

PLACE60052940//TRANSCRIPT REGULATORY PROTEIN GAL11 7/1 .70E-05//445aa//23%//P32257 
PLACE60053280 

PLACE60054230//DIAPHANOUS PROTEIN HOMOLOG 27/1 .00E-35//385aa//26%//060879 
PLACE60054820//HYPOTHETICAL PROTEIN KIAA0032.//1 .20E-50//180aa//61%//Q 15034 
20 PLACE60054870//MYOSIN HEAVY CHAIN, NONMUSTYPEB (CELLULAR MYOSIN HEAVYCHAIN, TYPE B)(NMM- 
HC-B)7/3.70E-11//434aa//20%//P35580 
PLACE60055350 

PLACE60055460//Homo sapiens leucine-zipper protein FKSG13 (FKSG13) mRNA, complete cds7/5.40E-164// 
327aa//99%//AF31 2393 

25 PLACE60055590//MYOS1N-BINDING PROTEIN H (MYBP-H) (H-PROTEIN) (86 KDA PROTEIN)7/1 .30E-05//124aa// 
35%//Q05623 
PLACE60056910 
PLACE60057860 
PLACE60061370 

30 PLACE60062660//ADP-ribosylation factor binding protein GGA1 ; ADP-ribosylation factor binding protein; Golgi-asso- 
ciated, gamma-adaptin ear containing, ARF-binding protein 1 [Homo sapiens] 7/6. 00 E-84//249aa//93%//NP_03 7497 
PLACE60062870 
PLACE60063940 

PLACE600641 80//PUTATIVE PRE-MRNA SPLICING FACTOR ATP-DEPENDENT RNA 
35 HELICASE.//1 .90E-51//368aa//37%//022899 

PLACE60064740//ADHESIVE PLAQUE MATRIX PROTEIN PRECURSOR (FOOT PROTEIN 1 ) (MCFP1).//4.80E-11// 
1 57aa//3 1 %//Q25434 

PLACE60066970//ZINC FINGER PROTEIN 1 91 7/1 .30E-36//115aa//48%//0 14754 
PLACE6006871 0//SUPPRESSOR PROTEIN SRP407/9.50E-43//238aa//50%//P32583 
40 PLACE60069880 
PLACE60070500 

PLACE60071 800//CORONIN-LIKE PROTEIN P577/3.80E-60//1 08aa//81 %//Q921 76 

PLACE60072390 

PLACE60072420 

45 PLACE60073090//Homo sapiens myo-inositol 1 -phosphate synthase A1 (ISYNA1 ) mRNA, complete cds.//6.60E-21 9// 
362aa//98%//AF220530 
PLACE60074820 
PLACE60077870 

PLACE60080360//mucin [Homo sapiens]//5.50E-05//164aa//30%//CAA84032 
50 PLACE60081260 
PLACE60082850 

PLACE60087680//INSULIN-LIKE GROWTH FACTOR BINDING PROTEIN 3 PRECURSOR (IGFBP-3) (IBP-3) 
(IGF-BINDING PROTEIN 3)7/2.30E-103//255aa//79%//P 17936 
PLACE60088240 
55 PLACE60092280 
PLACE60092370 
PLACE60093380 
PLACE60095240 
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PLACE60095600//HYPOTHETICAL HELICASE C28H8.3 IN CHROMOSOME HI.//2.70E-28//201 aa//38%//Q 09475 
PLACE60098350//Human hepatocellular carcinoma associated protein (JCL-1) mRNA, complete cds.//5.20E-285// 
558aa//97%//U92544 

PLACE60104630//macrophage migration inhibitory factor (glycosylation-inhibiting factor) [Homo sapiens]//9.70E-51// 
5 1 1 0aa//93%//XP_000858 

PLACE601 05680//Homo sapiens mRNA for TU12B1 -TY, complete cds.//1 .70E-30//65aa//64%//AB032773 
PLACE601 0701 0//SUPPRESSOR PROTEIN SRP407/3.80E-05//117aa//29%//P32583 
PLACE601 09910 

PLACE60113340%/BASEMENT MEMBRANE-SPECIFIC HEPARAN SULFATE PROTEOGLYCAN CORE PROTEIN 
10 PRECURSOR (HSPG) (PERLECAN) (PLC).//5.90E-65//238aa//32%//Q05793 

PLACE60118810//Rattus norvegicus kinesin light chain KLCt mRNA, complete cds.//1 ,90E-230//504aa//87%// 
AF1 66267 

PLACE60119700//Homo sapiens mRNA for ABP32, complete cds.//2.30E-21//47aa//100%//AB0 18357 
PLACE60120280//SERvTHR-RICH PROTEIN T10 IN DGCR REGION.//6.00E-99//126aa//84%//P54797 
15 PLACE601 22970//novel C2H2 type zinc finger protein//5.60E-84//1 69aa//98%//CAC1 0457 
PLACE60132200/yTRICH0HYALINV/3.10E-47//297aa//47%//P37709 
PLACE601 32320 
PLACE601 32880 

PLACE601 38840//PUTATIVE MITOCHONDRIAL CARRIER PROTEIN PET8.//7.40E-59//274aa//47%//P38921 
20 PLACE60140640//Homo sapiens nucleotide binding protein (NBP) mRNA, complete cds.//3.90E-138//262aa//99%// 
AF208536 

PLACE601 5051 0//NU CLEAR PROTEIN SNF7.//1 .60E-11//1 89aa//25%//P39929 

PLACE601 54450//PUTATIVE PREOPTIC REGULATORY FACTOR-2 PRECURSOR (PORF-2).//7.30E-36//75aa// 
98%//P18890 
25 PLACE601 55910 
PLACE601 57310 
PLACE60162100 
PLACE601 75640 

PLACE60177880//IGSF5 [Homo sapiens]y/3.60E-174//315aa//99%//CAB90447 
30 PLACE601 77910 
PLACE60181870 

PLACE601 8441 0//Mus musculus peroxisomal long chain acyl-CoA thioesterase lb (Pte1 b) gene, exon 3 and complete 
cds.//1.00E-43//126aa//69%//AF1 80801 

PLACE601 84870//PHOSPHOLIPID TRANSFER PROTEIN PRECURSOR (LIPID TRANSFER PROTEIN ll).//1.50E- 
35 227//330aa//99%//P55058 

PLACE60188630//Mus musculus mRNA for acetylglucosaminyltransferase-like protein.//7.80E-08//317aa//23%// 
AJ006278 

PROST10001100//Zea mays clone AGPZml arabinogalactan protein (agp) mRNA, partial cds.//5.80E-05//180aa// 
32%//AF 134579 
40 PROST1 0001 360 
PROST1 00021 50 

PROST20007170//Hypothetical Kruppel-Type Zinc Finger Protein (R28830_1 )//0//432aa//1 00%//AAC24608 
PROST20007600//PUTATIVE ENDONUCLEASE C1F12.06C (EC 3.1 .-. -).//5. 90 E-29//134aa//44%//Q 10348 
PROST20011160 
45 PROST20011800 
PROST20014140 
PROST20014150 
PROST200 14650 

PROST20015210//MYOSIN HEAVY CHAIN, SMOOTH MUSCLE ISOFORM (SMMHC) (FRAGMENT).//3.00E-107// 
50 262aa//85%//P35749 
PROST200 15400 

PROST20016760//M-PHASE PHOSPHOPROTEIN 8 (FRAGMENT).//9.00E-157//298aa//99%//Q99549 
PROST20022120 

PROST20024250//ZINC FINGER PROTEIN 1367/1. 70 E-45//128aa//63%//P52 737 
55 PROST20028970//Oryctolagus cuniculus CARP mRNA, complete cds.//4.80E-44//177aa//51%//AF131883 

PROST20033240//EPHRIN TYPE-A RECEPTOR 6 PRECURSOR (EC 2.7.1.112) (TYROSINE-PROTEIN KINASE 
RECEPTOR EHK-2) (EPH HOMOLOGY KINASE-2).//8.40E-241//441aa//95%//Q62413 

PROST20035170//Homo sapiens zinc finger protein dp mRNA, complete cds .//3. 40 E-15//128aa//42%//AF 153201 
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PROST20035830 
PROST20036280 

PROST20036350//MULTI FUNCTIONAL AMINOACYL-TRNA SYNTHETASE [INCLUDES: GLUTAMYL-TRNA SYN- 
THETASE (EC 6.1.1.17) (GLUTAMATE--TRNA LIGASE); PROLYL-TRNA SYNTHETASE (EC 6.1.1.15) (PROLINE- 
5 -TRNA LIGASE)].//2.20E-137//651aa//42%//P07814 

PROST20039300//Bos taurus mitochondrial mRNA for xenobiotic/medium-chain fatty acid:CoA ligase form XL-Ill.// 

1 .60E-68//180aa//68%//AJ1 32751 

PROST20041460 

PROST20042700 

10 PROST20045700//Zea mays clone AGPZml arabinogalactan protein (agp) mRNA, partial cds.//5.80E-05//1 80aa// 
32%//AF 134579 
PROST20047440 
PROST20048170 

PROST20050390//CYTOCHROME P450 2J2 (EC 1.14.14.1) (CYPIIJ2) (ARACHIDONIC ACID EPOXYGENASE).// 
15 1 .40E-34//1 88aa//42%//P51 589 

PROST20051310//Homo sapiens DEAD-box protein abstrakt (ABS) mRNA, complete cds 7/8.50 E-134//257aa//99%// 

AF195417 

PROST20052720 

PROST20052850//CYCLIN G-ASSOCIATED KINASE (EC 2.7.1 .-) 7/2.20 E-18//107aa//54%//P97874 
20 PROST20054660 

PROST20058860//SMALL NUCLEAR RIBONUCLEOPROTEIN ASSOCIATED PROTEINS BAND B' (SNRNP-B) (SM 

PROTEIN B/B') (SM-B/SM-B , ).//8.20E-05//134aa//33%//P14678 

PROST20060200 

PROST20062820/7TRANSLATION INITIATION FACTOR IF-27/1 .50E-07//1 02aa//41 %//P71613 
25 PROST20063430//BCL2/ADENOVIRUS E1B 19-KDA PROTEIN-INTERACTING PROTEIN 27/8.50E-74//305aa// 
46%//Q 12982 
PROST20065100 

PROST20065790//6-PHOSPHOFRUCTOKINASE, TYPE (EC 2.7.1. 11) (PHOSPHOFRUCTOKINASE 1)(PHOSPHO- 
HEXOKINASE) (PHOSPHOFRUCTO-1 -KINASE ISOZYME C) (6-PHOSPHOFRUCTOKINASE, PLATELET TYPE)7/ 
30 0//697aa//99%//Q01 81 3 
PROST20073280 

PROST20075280//MYOSIN LIGHT CHAIN KINASE, SMOOTH MUSCLE AND NON-MUSCLE ISOZYMES (EC 
2.7.1 .117) (MLCK) [CONTAINS: TELOKIN]7/1 .30E-08//245aa//23%//P1 1 799 
PROST20078710 
35 PROST20082430 

PROST20084470//Plasmodium berghei strain NYU2 merozoite surface protein-1 mRNA, partial cds.//2.50E-08// 

1 22aa//28%//AF00041 3 

PROST20084680 

PROST20084720//CYTOCHROME P450 4F2 (EC 1 .14.13.30) (CYPIVF2) (LEUKOTRIENE-B4 OMEGA- HYDROXY- 
40 LASE) (LEUKOTRIENE-B4 20-MONOOXYGENASE) (CYTOCHROME P450- LTB-O MEGA) 7/1 .50E-37//85aa//85%// 
P78329 

PROST20087240 
PROST20093470 
PROST20094000 
45 PROST20097310 
PROST20097360 

PROST20097840//SYNAPSIN 17/1 80E-09//193aa//34%//P17599 

PROST20099090//ADAM 12 PRECURSOR (EC 3.4.24.-) (A DISINTEGRIN AND METALLOPROTEINASE DOMAIN 
12) (MELTRIN ALPHA).//1 .90E-22//73aa//60%//043184 PROST20102190//CALMODULIN7/1 .30E-20//98aa//47%// 
50 P02594 

PROST201 02500 
PROST201 03820 

PROST20105450//SODIUM/HYDROGEN EXCHANGER 6 (NA(+)/H(+) EXCHANGER 6) (NHE-6) (KIAA0267).// 
1 .60E-96//21 4aa//75%//Q92581 
55 PROST201 06060 

PROST20108850//MICROSOMAL SIGNAL PEPTIDASE 23 KDA SUBUNIT (EC 3.4.-.-) (SPC22/23).//1 .20E-69// 

1 32aa//1 00%//PI2280 

PROST20110120 
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PROST20114100 

PROST20120070//KINESIN HEAVY CHAIN (UBIQUITOUS KINESIN HEAVY CHAIN) (UKHC).//5.00E-05//286aa// 

24%//P33176 

PROST20121570 

5 PROST20122490//Gallus gallus syndesmos mRNA, complete cds.//1 .20E-63//139aa//84%//AF095446 
PROST201 24000 
PROST20 125420 

PROST20127450//Homo sapiens TSC-22 related protein (TSC-22R) mRNA, complete cds.//7.90E-44//95aa//98%// 
AF1 53603 
10 PROST201 30320 
PROST201 38730 

PROST20146590//MUCIN 1 PRECURSOR (POLYMORPHIC EPITHELIAL MUCIN) (PEM) (PEMT) (EPISIALIN) (TU- 
MOR-ASSOCIATED MUCIN) (CARCINOMA-ASSOCIATED MUCIN) (TUMOR-ASSOCIATED EPITHELIAL MEM- 
BRANE ANTIGEN) (EMA) (H23AG) (PEANUT- REACTIVE URINARY MUCIN) (PUM) (BREAST CARCINOMA-ASSO- 

15 CIATED ANTIGEN DF3).//3.50E-08//556aa//23%//P1 5941 

PROST20151370//Human probable zinc finger protein H101 mRNA, partial cds.//1 .60 E- 11 //104aa//41%//U8 1557 
P RO ST20 1 525 1 0//U D P-N -AC ETYLG LUC OS AMINE-PEPTIDE N-ACETYLGLUCOSAMINYLTRANSFERASE 100 
KDA SUBUNIT (EC 2.4.1.-) (0-GLCNAC TRANSFERASE P100 SUBUNlT).//6.00E-17//148aa//34%//0 15294 
PROST20152870//Homo sapiens APC2 gene, exon 147/1 .60E-05//195aa//309b//AJ 131 187 

20 PROST20155370//ZINC FINGER PROTEIN 38 (ZFP-38) (CTFIN51) (TRANSCRIPTION FACTOR RU49).//1 .10E-72// 
1 40aa//55%//Q07231 
PROST201 56360 
PROST20 159320 

PROST20168600//Homo sapiens six transmembrane epithelial antigen of prostate (STEAP1) mRNA, complete cds.// 
25 1 .20E-70//237aa//54%//AF1 86249 

PUAEN10000650//Homo sapiens TSC-22 related protein (TSC-22 R) mRNA, complete cds.//7.90E-44//95aa//98%// 
AF1 53603 
PUAEN1 0000870 

PUAEN10001640//Mus musculus cerebellar postnatal development protein-1 (Cpd1) mRNA, partial cds.//2.80E-126// 
30 270aa//90%//U89345 
PUAEN20000800 

PUAEN20001520//L-A VIRUS GAG PROTEIN N-ACETYLTRANSFERASE (EC 2.3.1.-).//1 .70E-34//145aa//51 96// 
Q03503 

PUAEN20002470//PR MOV-1 0.//6.30E-1 02//405aa//44%//P23249 
35 PUAEN200031 20//ENHANCER OF ZESTE HOMOLOG 2 (ENX-1).//0//643aa//97%//Q 15910 SALGL1 0001070// 
CH-TOG PROTEIN (COLONIC AND HEPATIC TUMOR OVER-EXPRESSED PROTEIN) (KIAA0097).//5.30E-150// 
297aa//96%//Q1 4008 
SKMUS20006790 

SKMUS20007260//MICROTUBULE-ASSOCIATED PROTEIN 1B (MAPI. 2) (MAP1(X)) [CONTAINS: NIAP1 LIGHT 
40 CHAIN LC1]7/8.10E-05//396aa//23%//P14873 

SKMUS20008730//smoothelin large isoform L2 [Homo sapiens] .//1 .20E-221//217aa//98%//AAF0 1481 
SKMUS20017400//TROPOMYOSIN ALPHA CHAIN, SKELETAL MUSCLE TYPE. //3.20E-97//242aa//81%//P06753 
SKMUS20020770 
SKMUS20026340 

45 SKMUS20040440//60S RIBOSOMAL PROTEIN L3 (HIV-1 TAR RNA BINDING PROTEIN B) (TARBP-B)7/3.60E-189// 
229aa//99%//P39023 
SKMUS20064810 

SKMUS20073150//20-HYDROXYECDYSONE PROTEIN PRECURSOR (20-HE).//1 .50E-05//129aa//32%//P29681 
SKMUS20073590//tropomodulin 4 (muscle) [Homo sapiens]//3.90E-58//115aa//100%//NP_037485 
50 SKMUS200791 50//splicing factor 3a, subunit 3, 60kD; pre-mRNA splicing factor SF3a (60kD), similar to S. cerevisiae 
PRP9 (spliceosome-associated protein 61) [Homo sapiens]//9.10E-13//65aa//60%//NP_006793 
SKMUS20091900 

SKNMC1 0001 230//CYCLIN-DEPENDENT KINASE 4 INHIBITOR A (CDK4I) (P16-INK4) (P16-INK4A) (TUMOR SUP- 
PRESSOR CDKN2A).//1 .60E-08//105aa//38%//077617 
55 SKNMC20006350//65 KDA FK506-BINDING PROTEIN PRECURSOR (EC 5.2.1.8) (FKBP65) (FKBPRP) (PEPTI- 
DYL-PROLYL CIS-TRANS ISOMERASE) (PPIASE) (ROTAMASE) (IMMUNOPHILIN FKBP65).//1 10E-185//419aa// 
79%//Q61576 
SKNSH10001010 
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SKNSH20007160 

SKNSH20009710//TROPOMYOSIN, CYTOSKELETAL TYPE (TM30-NM)./13.20E-74//1 74aa//86%//P 12324 
SKNSH20030640//SPLICEOSOME ASSOCIATED PROTEIN 62 (SAP 62) (SF3A66).//3.00E-06//127aa//33%// 
Q62203 
5 SKNSH20040390 

SKNSH20052400/A/EGETATIBLE INCOMPATIBILITY PROTEIN HET-E-1 .//3.60E-1 7//1 75aa//29%//Q00808 
SKNSH20057920//CALCIUM/CALMODULIN-DEPENDENT PROTEIN KINASE TYPE I (EC 2.7.1.123) (CAM KINASE 
I ) 7/5 . 00 E -37// 1 97aa//40%//Q 14012 
SKNSH20068220 
10 SKNSH20094350 

SMINT20000070//Mus musculus mRNA for granuphilin-b, complete cds.//2.70E-44//128aa//41%//AB025259 
SMINT20002320//SERINE/THREONJNE PROTEIN PHOSPHATASE 2A, 56 KDA REGULATORY SUBUNIT, BETA 
ISOFORM (PP2A, B SUBUNIT, B' BETA ISOFORM) (PP2A, B SUBUNIT, B56 BETA ISOFORM) (PP2A, B SUBUNIT, 
PR61 BETA ISOFORM) (PP2A, B SUBUNIT, R5 BETA lSOFORM)7/4.70E-160//299aa//100%//Q15173 
15 SMINT20006020//faciogenital dysplasia protein 2 [Mus musculus]//4.10E-158//327aa//87%//AF01 7368 

SMINT20006090//Oryctolagus cuniculus mRNA for parchorin, complete cds.//7.50E-87//175aa//93%//AB035520 
SMINT20007470//TRICHOHYALIN.//1.20E-37//492aa//28%//Q07283 

SMINT200081 1 0//CALCIUM-T RAN SPORTING AT PAS E 2C1 (EC3.6.1 .38) (ATP-DEPENDENTCA2+ PUMP PMR1).// 
1 .20E-50//165aa//63%//P98194 
20 SMINT20011830 

SMINT20011950//ZINC FINGER PROTEIN 202.//1 .90E-67//426aa//40%//095125 

SMINT20012220 

SMINT20013970 

SMINT20014610 

25 SMINT20016150//FERR1TIN LIGHT CHAIN (FERRITIN L SLJBUNIT).//3.50E-91//1 74aa//1 00%//P02792 
SMINT20017310 
SMINT20021260 
SMINT20023110 

SMINT20024140//IG KAPPA CHAIN V-IV REGION B1 7 PRECURSOR 7/1 .20 E-60//134aa//87%//P 0631 4 
30 SMINT20026200//ENL PROTEIN.//1 .10E-05//260aa//24%//Q03111 

SMINT20028800//tumor supressor protein - fruit fly (Drosophila melanogaster).//2.00E-78//493aa//34%//T13797 
SMINT20028840//CMRF35 ANTIGEN PRECURSOR.//8.40E-1 9//136aa//41 %//Q08708 
SMINT20030740//ZINC FINGER PROTEIN 1 367/4. 50 E-1 94//535aa//63%//P52737 
SMINT20031280 

35 SMINT20035050//GTPASE ACTIVATING PROTEIN BEM2/IPL2.//1 .20E-07//134aa//26%//P39960 

SMINT20035510//Drosophila melanogaster La related protein (larp) mRNA, partial cds.//5.40E-39//334aa//30%// 
AF221108 

SMINT20036440//Drosophila melanogaster epsin-like protein mRNA, complete cds.//2.90E-69//446aa//41%// 
AF233291 

40 SM I NT20038660//Homo sapiens HNOEL-iso (HNOEL-iso) mRNA, complete cds7/5.60E-201//368aa//99%//AF201945 
SMINT20039050//Homo sapiens TRIAD3 mRNA, partial cds.//3.20E-86//1 56aa//1 00%//AF228527 
SMINT20043390 

SMINT20044140//ZINC FINGER PROTEIN 84 (ZINC FINGER PROTEIN HPF2)7/3.50E-39//440aa//28%//P51523 
SMINT20044730//UBIQUINONE BIOSYNTHESIS PROTEIN AARF7/6.20E-22//272aa//27%//P27854 
45 SMINT20045470 
SMINT20045830 
SMINT20045890 
SMINT20047290 
SMINT20048720 

50 SMINT20049920//CYTOSOLIC PURINE 5'-NUCLEOTIDASE (EC 3.1 .3.5)7/4.1 0E-39//132aa//57%//P49902 

SMINT20052130//Rattus norvegicus mRNA for gankyrin homologue, complete cds.//9.20E-07//1 04aa//33%// 
AB022014 

SMINT20054050//ABC1 PROTEIN HOMOLOG P R ECU RS OR 7/1 .40 E-98//467aa//45%//Q92338 
SMINT20056230//Ig mu chain precursor, membrane-bound (clone 201) - human//5. 60 E-233//422aa//78%//S 14683 
55 SMINT20056240 

SMINT20062050//PLECTIN7/7.50E-17//436aa//25%//P30427 

SMINT20067080 

SMINT20070620 
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SMINT20074330//tektin A1 [Strongylocentrotus purpuratus]//3.10E-26//125aa//45%//M97188 
SMINT20077920 

SMINT20077960//GELSOLIN PRECURSOR, PLASMA (ACTIN-DE POLYMERIZING FACTOR) (ADF) (BREVIN) 
(AGEL).//3.30E-246//459aa//99%//P06396 
5 SMINT20081330 

SM1NT20083290//IG ALPHA- 1 CHAIN C REGION.//4.40E-196//352aa//99%//P01876 

SMINT20084910 

SMINT20085310 

SMINT20085450 

10 SMINT20086250//GLYCINE CLEAVAGE SYSTEM H PROTEIN PRECURSOR.//2.20E-40//70aa//97%//P23434 
SMINT20086720//ZINC FINGER PROTEIN 1 91 .//1 .40E-29//109aa//55%//0 14754 

SMINT20088440//IG KAPPA CHAIN V-ll REGION RPMI 6410 PRECURSOR.//5.10E-44//117aa//78%//P06310 

SMINT20088690 

SMINT20089210 

15 SMINT20089600//Homo sapiens mRNA for PICK1 , complete cds./1 3.1 0E-1 45//278aa//1 00%//AB026491 

SMINT200911 90//IG ALPHA- 1 CHAIN C REGION.//3.70E-198//353aa//99%//P01876 

SMINT20092120 

SMINT20092160 

SMINT20093630 
20 SMINT20094150 

SMINT20094680//Homo sapiens mawbp mRNA for MAWD binding protein, complete cds.//2.80E-50//77aa//100%// 
AB049758 
SPLEN20005160 
SPLEN20005370 
25 SPLEN20006950 
SPLEN20011350 
SPLEN20012450 
SPLEN20015030 

SPLEN2001 5100//HYPOTHETICAL 72.5 KDA PROTEIN C2F7.10 IN CHROMOSOME l.//7.50E-16//121aa//36%// 
30 Q09701 

SPLEN20016500 

SPLEN20017610 

SPLEN20017810 

SPLEN20019120 
35 SPLEN20020530 

SPLEN20023430 

SPLEN20023540//H.sapiens mRNA for F25B3.3 kinase like protein from C. elegans.//1 .50E-205//385aa//99%// 
Y 12336 

SPLEN20023850//DNA REPAIR PROTEIN RAD187/3.00E-56//469aa//30%//P53692 
40 SPLEN20024190//EGF-containing fibulin-like extracellular matrix protein 1 ; fibrillin-like [Homo sapiens]//3.70E-1 92// 
327aa//99%//NP_061 489 
SPLEN20024510 

SPLEN20024620//Homo sapiens mRNA for acetyl LDL receptor, complete cds.//1 .00E-21 7//401 aa//1 00%//D86864 
SPLEN20024770//Rattus norvegicus (rsec6) mRNA, complete cds.//2.10E-88//545aa//31%//U32575 
45 SPLEN20024930//Rattus norvegicus PIPP mR.NA for proline-rich inositol polyphosphate 5-phosphatase, complete 
cds.//0//639aa//91 %//AB032551 
SPLEN20029170 
SPLEN20036780 

SPLEN200391 80//TENSIN.//2.70E-1 35//341 aa//653o//Q04205 
so SPLEN20040780//C B (SMALL PROLINE-RICH PROTEIN 1 B) (SPR1 B) (SPR1 
B).//5. 1 0E-1 2//1 1 0aa//37%//Q62267 

SPLEN20041810//BC-2 protein [Homo sapiens]//4.00E-24//59aa//96%//AF042384 
SPLEN20042200//TRANSCRIPTIONAL REPRESSOR CTCF.//8. 40 E-22//127aa//33%//P49711 
SPLEN20043430 
55 SPLEN20043460 

S PLEN20043680//DN A-REPAI R PROTEIN COMPLEMENTING XP-D CELLS (XERODERMA PIGMENTOSUM 
GROUP D COMPLEMENTING PROTEIN) (DNA EXCISION REPAIR PROTEIN ERCC-2)7/7.30E-1 71//325aa//99%// 
P 18074 
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SPLEN20045550 

SPLEN20048800//Homo sapiens mRNA for N-Acetylglucosamine kinase 7/1 .90E-51//1 04aa//100%//AJ242910 
SPLEN20049840//M.musculus mRNA for myosin l.//0//1093aa//89%//X97650 
SPLEN20050090//TRICHOHYALIN7/7.20E-17//554aa//23%//P37709 

5 SPLEN20051420 

SPLEN20054160//Dof protein [Drosophila melanogastery/9.60E-14//222aa//29%//AJ010641 
SPLEN20054500//Homo sapiens mRNA for putative dipeptidase.//7.10E-130//244aa//100%//AJ295149 
SPLEN20055600//ZINC FINGER PROTEIN 46 (ZINC FINGER PROTEIN KUP).//3.00E-56//155aa//63%//P24278 
SPLEN20057830//REGULATOR OF MITOTIC SPINDLE ASSEMBLY 1 (RMSA-1).//1 .10E-16//139aa//43%//P49646 

10 SPLEN20057900//Homo sapiens N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase mRNA : 
complete cds.//9.00E-75//130aa//93%//AF1 87072 
SPLEN20058180 

SPLEN20059270//Z1NC-FINGER PROTEIN RFP (RET FINGER PROTEIN).//7.20E-61//497aa//31%//P1 4373 
SPLEN20062830 

15 SPLEN20063250//zinc finger protein nocA - fruit fly (Drosophila melanogaster)//9.00E-1 6//364aa//30%//A55929 

SPLEN20063890//FIBROMODULIN PRECURSOR (FM) (COLLAGEN-BINDING 59 KDA PROTEIN).//2.60E-118// 

242aa//94%//Q06828 

SPLEN20067010 

SPLEN20071820//Homo sapiens DNA polymerase mu (Pol mu) mRNA, complete cds.//4.70E-62//11 6aa//1 00%// 
20 AF1 76097 

SPLEN20073500//TUMOR NECROSIS FACTOR, ALPHA-INDUCED PROTEIN 2 (B94 PR0TEIN).//3.50E-10//463aa// 

25%//Q03169 

SPLEN20073880 

SPLEN20076190 

25 SPLEN20076470//KINES1N LIGHT CHAIN (KLC) 7/2.40 E-1 8//1 37aa//38%//P46825 

SPLEN20080070//TISSUE ALPHA-L-FUCOSIDASE PRECURSOR (EC 3.2.1.51) (ALPHA-L-FUCOSIDA I) (AL- 

PHA-L-FUCOSIDE FUCOHYDROLASE) 7/4.70E-253//359aa//98%//P04066 

SPLEN20081640 

SPLEN20085910//Homo sapiens protein activator of the interferon-induced protein kinase (PACT) mRNA, complete 
30 cds.//2.90E-23//52aa//1 00%//AF072860 
SPLEN20087370 
SPLEN20087860 

SPLEN20090880//HLA CLASS I HISTOCOMPATIBILITY ANTIGEN, A-24(A-9) ALPHA CHAIN PRECURSOR (AW- 
24)7/1 .40E-66//1 53aa//83%//P05534 
35 SPLEN20098030//TRANSCRIPTION INTERMEDIARY FACTOR 1-BETA (NUCLEAR COREPRESSOR KAP-1) 
(KRAB-ASSOCIATED PROTEIN 1)7/4.00E-18//267aa//26%//Q13263 

SPLEN20100040//258.1 1 KDA PROTEIN C210RF5 (KIAA0933).//7.90E-46//223aa//43%//Q9Y3R5 
SPLEN201 01 950//SODIUM/HYDROGEN EXCHANGER 6 (NA(+)/H(+) EXCHANGER 6) (NHE-6) (KIAA0267)7/7.70E- 
1 1 2//353aa//6 1 %//Q9258 1 
40 SPLEN20104150 
SPLEN201 04690 
SPLEN20105100 

SPLEN20108000//Homo sapiens steroid dehydrogenase homolog mRNA, complete cds.//9.30E-73//155aa//94%// 
AF078850 
45 SPLEN201 08460 

SPLEN20110180//Homo sapiens golgi membrane protein GP73 mRNA, complete cds.//1 .60E-1 62//347aa//91%// 
AF236056 
SPLEN201 10210 
SPLEN201 10860 
50 SPLEN20111450 
SPLEN20114190 

SPLEN20116720//Homo sapiens misato mRNA, partial cds.//1 .00E-277//481aa//89%//AF272833 
SPLEN201 17580 

SPLEN2011 8050//BONE/CARTILAGE PROTEOGLYCAN I PRECURSOR (BIGLYCAN) (PG-S1)7/1 .00E-123//240aa// 
55 9796//P21810 

SPLEN20121790 
SPLEN201 25230 
SPLEN20126110 
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SPLEN20135030//Homo sapiens PDZ-LIM protein mystique mRNA, complete cds 7/3. 30 E-92//1 78aa//97%//AY007729 
SPLEN201 36700 

SPLEN20136730//Homo sapiens RAB-like protein 2A (RABL2A) mRNA, complete cds.//9.30E-41//102aa//90%// 
AF095350 
5 SPLEN201 37530 

SPLEN20138600//NUMB protein [Homo sapiens]7/1.00E-97//102aa//100%//AAD01548 

SPLEN20139100//Human (hybridoma H210) anti-hepatitis A IgG variable region, constant region, complementarity- 
determining regions mRNA, complete cds.//2.30E-227//490aa//85%//M87789 

SPLEN20139360//MAJOR CENTROMERE AUTOANTIGEN B (CENTROMERE PROTEIN B) (CENP-B)7/5.10E-35// 
10 295aa//32%//P071 99 
SPLEN201 75920 

SPLEN20176130//Homo sapiens mRNA for ALEX1 , complete cds.//9.90E-21//161aa/l32%//AB039670 
SPLEN201 77400 

SPLEN201 80980//METHIONYL-TRNA SYNTHETASE (EC 6.1.1.10) (METHIONINE--TRNA LIGASE) (METRS).// 
15 3.40E-57//1 33aa//87%//P561 92 

SPLEN20181570//TRICHOHYALIN7/7.80E-45//832aa//23%//P37709 
SPLEN201 82850 

SPLEN20182990//Mus musculus actin-binding protein (ENC-1) mRNA, complete cds.//1 .90E-49//586aa//26%// 
U65079 

20 SPLEN20183020//BASEMENT MEMBRANE-SPECIFIC HEPARAN SULFATE PROTEOGLYCAN CORE PROTEIN 
PRECURSOR (HSPG) (PERLECAN) (PLC).//1 .20E-24//128aa//46%//Q05793 
SPLEN201 83950 

SPLEN20187490//RESTIN (CYTOPLASMIC LINKER PROTEIN-170 ALPHA-2) (CLIP-170) (REED- STERNBERG IN- 
TERMEDIATE FILAMENT ASSOCIATED PROTEIN) 7/5.1 0E-09//411 aa//22%//P30622 
25 SPLEN201 90080 

SPLEN201 90430//NEURALIZED PROTEIN7/2.80E-09//181 aa//27%//P29503 
SPLEN201 90770 

SPLEN20191020//Homo sapiens MIST mRNA, partial cds.//1 .50E-207//376aa//99%//AB032369 
SPLEN201 92570 
30 SPLEN201 93230 
SPLEN201 93490 
SPLEN201 93750 

SPLEN20193790//INTERFERON-REGULATED RESISTANCE GTP-BINDING PROTEIN MXA 
(INTERFERON- INDUCED PROTEIN P78) (IFI-78K)7/0//572aa//98%//P20591 
35 SPLEN20195710//KINESIN LIGHT CHAIN (KLC)7/5.80E-28//145aa//45%//P46824 
SPLEN201 97090 
SPLEN201 97740 

SPLEN20197930//Rattus norvegicus putative transcription factor LUZP (Luzp) mRNA, complete cds7/6.10E-124// 
275aa//90%//AF1 81 259 
40 SPLEN20198390/yTIPD P ROTE IN 7/2. 80 E-52//307aa//37%//0 15736 
SPLEN201 99850 
SPLEN20200070 
SPLEN20200340 

SPLEN20201 830//BONE/CARTILAGE PROTEOGLYCAN I PRECURSOR (BIGLYCAN) (PG-S1 )7/2.50E-152//283aa// 
45 100%//P21810 

SPLEN20203590 

SPLEN20204670 

SPLEN20205120 

TESOP1 0000350 
50 TESOP1 0001 600 

TESTI1 00001 90//MUCIN 1 PRECURSOR (POLYMORPHIC EPITHELIAL MUCIN) (PEM) (PEMT) (EPISIALIN) (TU- 
MOR-ASSOCIATED MUCIN) (CARCINOMA-ASSOCIATED MUCIN) (TUMOR-ASSOCIATED EPITHELIAL MEM- 
BRANE ANTIGEN) (EMA) (H23AG) (PEANUT- REACTIVE URINARY MUCIN) (PUM) (BREAST CARCINOMA-ASSO- 
CIATED ANTIGEN DF3)7/1.40E-23//667aa//28%//P15941 
55 TEST1 10000850 

TESTI10001570//ZINC FINGER PROTEIN 84 (ZINC FINGER PROTEIN HPF2)7/1 .30E-148//598aa//4696//P51523 

TESTI2000431 0//TRICHOHYALIN7/1 .40E-07//247aa//25%//P22793 

TESTI20005980 
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TESTI20006160//CALCIUM-BINDING PROTEIN.//1.10E-11//260aa//28%//P35085 
TESTI20006830//GAR2 PROTEIN. //2.90E-05//221 aa//1 9%//P41 891 

TESTI20012080//TUBULIN-TYROSINE LIGASE (EC 6.3.2.25) (TTL).//2.20E-22//217aa//35%//P38584 
TESTI20012360 
5 TESTI20016970 
TESTI20019590 
TESTI20028020 

TESTI20029100//FIBROSIN (FRAGMENT) 7/1 .60E-1 0//70aa//48%//Q60791 

TESTI20030200//DOUBLE-STRANDED RNA-SPECIFIC EDITASE 1 (EC 3.5.-.-) (DSRNA ADENOSINE DEAMI- 
10 NASE) (RNA EDITING ENZYME 1).//1 .90E-30//192aa//38%//P51400 
TESTI20030440//TRICHOHYALIN.//2.80E-21//412aa//26%//P37709 
TESTI20030610 

TESTI20031310//ALPHA-1 -ANTICHYMOTRYPSIN PRECURSOR (ACT).//7.80E-222//423aa//99%//P01011 
TESTI20031410//SYNAPTONEMAL COMPLEX PROTEIN 1 (SCP-1 PROTEIN).//6.10E-11//449aa//24%//Q03410 
15 TESTI20032770//ANTER-SPECIFIC PROLINE-RICH PROTEIN APG PRECURSOR.//1 .40E-07//1 21 aa//37%// 
P40602 

TESTI20034750 
TESTI20035330 

TESTI20035790//SPLICING FACTOR, ARGININE/SERINE-RICH 4 (PRE-MRNA SPLICING FACTOR SRP75).// 
20 2.30E-44//346aa//37%//Q081 70 

TESTI20038240//Mus musculus neprilysin-like metal lope pti das e 1 (N11) mRNA, complete cds.//0//770aa//77%// 

AF1 76569 

TESTI20040850 

TESTI20041630//Mus musculus mRNA for type II cytokeratin : complete cds.//1 .30E-151//407aa//75%//AB033744 
25 TESTI20043130 

TESTI200431 80//mouse mRNA for megakaryocyte potentiating factor, complete cds.//8 .40 E-06//392aa//24%//D 86370 

TE STI20043220//O RM 1 P ROTE I N 7/6 . 40 E-2 1 //1 38aa//37%//P53224 

TESTI20043910 

TESTI20043990//APOMUCIN (MUCIN CORE PROTEIN) ( FRAGMENT) 7/8. 70E-64//283aa//53%//P 12021 
30 TESTI20044900//Strongylocentrotus purpuratus radial spokehead mRNA, complete cds.//2.50E-150//447aa//61%// 
U73123 

TESTI20045390//Homo sapiens adlican mRNA, complete cds.//1 .80E-1 73//632aa//47%//AF245505 

TESTI20045740 

TESTI20046110 

35 TESTI20046490//Homo sapiens B2 gene partial cDNA, clone B2E7/4.30E-33//284aa//34%//AJ002220 
TESTI20046540 

TESTI20046870//RETINAL-BINDING PROTEIN (RALBP)7/4.00E-06//244aa//23%//P49193 
TESTI20046890//Mus musculus axotrophin mRNA, complete cds7/2.30E-40//173aa//53%//AF1 55739 
TESTI20047370//ATROPHIN-1 (DENTATORUBRAL-PALLIDOLUYSIAN ATROPHY PROTEIN).//1 .90E-22//695aa// 
40 28%//P54258 

TESTI20047930//Homo sapiens NY-REN-2 antigen mRNA, complete cds.//1 .50E-1 91 //530aa//67%//AF1 55095 
TESTI20049060//H. sapiens mRNA for SIRP-beta1 7/2.50E-31//172aa//46%//Y 10376 
TESTI20049410 
TESTI20049990 
45 TESTI20050170 
TESTI20050400 

TESTI20050720//SUCCINYL-COA:3-KETOACID-COENZYME A TRANSFERASE PRECURSOR (EC 2. 8. 3. 5) (SUC- 
CINYL COA:3-OXOACID COA-TRANSFERASE) (OXCT)7/1 .40E-208//51 9aa//74%//P55809 

TESTI20051200//POLYPEPTIDE N-ACETYLGALACTOSAMINYLTRANSFERASE (EC 2.4.1.41) (PROTEIN- UDP 
50 ACETYLGALACTOSAMINYLTRANSFERASE) (UDP-GALNAC: POLYPEPTIDE, N-ACETYLGALACTOSAMINYL- 
TRANSFERASE) (GALNAC-T1 ) 7/6.1 0E-50//1 89aa//49%//Q1 0472 
TESTI20051 730//MALTOSE PERMEASE.//2.20E-05//327aa//23%//Q45632 
TESTI20052670 

TESTI20053070//HYPOTHETICAL WD-REPEAT PROTEIN SLL01 63.//5.00E-06//1 72aa//28%//Q55563 
55 TESTI20053260 
TESTI20053780 
TESTI20053800 
TESTI20053950 
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TESTI20054700//Streptococcus pneumoniae strain g375 surface protein PspC (pspC) gene, pspC-8. 1 aileie, complete 
cds.//6.70E-11//219aa//32%//AF154015 

TESTI20055680//TRANSLATION INITIATION FACTOR IF-2.//5.90E-08//98aa//41%//060841 
TESTI20055880 

s TESTI20056030//Homo sapiens 88-kDa Golgi protein (GM88) mRNA, complete cds.//3.60E-103//316aa//6896// 
AF204231 
TESTI20057200 

TESTI20057430//Z1NC FINGER PROTEIN 8 (ZINC FINGER PROTEIN HF.18) (FRAGMENT).//3.00E-307//543aa// 
99%//P17098 

10 TESTI20057590//TYROSINE-PROTEIN KINASE-LIKE 7 PRECURSOR (COLON CARCINOMA KINASE-4) (CCK-4).// 
7.80E-07//1 52aa//27%//Q1 3308 

TESTI20057840//INHIBITOR OF APOPTOSIS PROTEIN (IAP) (INHIBITOR OF T CELL APOPTOSIS PROTEIN).// 
7.20E-08//86aa//36%//Q90660 

TESTI20057880//CALDESMON (CDM).//6.20E-08//203aa//29%//P1 2957 
15 TESTI20058350//SERINE/THREONINE PROTEIN PHOSPHATASE 2A, 55 KDA REGULATORY SUBUNIT B, GAM- 
MA ISOFORM (PP2A, SUBUNIT B, B-GAMMA ISOFORM) (PP2A, SUBUNIT B, B55-GAMMA ISOFORM) (PP2A, 
SUBUNIT B, PR55-GAMMA ISOFORM) (PP2A, SUBUNIT B, R2-GAMMA ISOFORM).//2.00E-232//426aa//99%// 
P50410 

TESTI20058920//TUBULIN ALPHA-3/ALPHA-7 CHAIN.//3.50E-1 48//277aa//98%//P05214 
20 TESTI20059080//Homo sapiens hyaluronidase (LUCA-3) mRNA, completecds7/2.40E-170//298aa//100%//AF040710 
TESTI20059330 
TESTI20059370 
TESTI20059480 
TESTI20059790 

25 TESTI20059810//ZINC FINGER PROTEIN 84 (ZINC FINGER PROTEIN HPF2).//7.50E-153//536aa//52%//P51523 
TESTI20060080//MYOSIN HEAVY CHAIN, CLONE 203 (FRAGMENT).//5.30E-09//236aa//24%//P 39922 
TESTI20060150 
TESTI20060350 
TESTI20060450 

30 TESTI20060830//Mus musculus mRNA for MIWI (piwi), complete cds.//0//824aa//94%//AB032604 
TESTI20061090 

TESTI20061200//NITRITE EXTRUSION PROTEIN (NITRITE FACILITATOR).//4.90E-05//379aa//23%//P46907 

TESTI20062120//poly(A)-specific ribonuclease (deadenylation nuclease) ; deadenylation nuclease [Homo sapiens]// 

1 .00E-38//144aa//36%//NP_002573 
35 TESTI20062180 

TESTI20062580 

TESTI20063330 

TESTI20063410 

TESTI20063600 
40 TESTI20064370 

TESTI20064530//microtubule-associated protein like echinoderm EMAP [Homo sapiens].//1 .00E-173//562aa//48%// 
XP_009139 

TESTI20064650//Rattus norvegicus myr 6 myosin heavy, chain mRNA, complete cds.//0//645aa//91%//U6041 6 
TESTI20064990 

45 TESTI20065650//INNER CENTROMERE PROTEIN (INCENP).//1 .00E-14//273aa//27%//P53352 
TESTI20066150 
TESTI20066170 
TESTI20066280 

TESTI20066330//FIBRONECTIN PRECURSOR.//9.1 0E-12//408aa//22%//Q91740 
50 TESTI20066590 

TESTI20066650//CELL DIVISION CONTROL PROTEIN 25.//2.20E-1 8//21 6aa//32%//P04821 
TESTI20067350 

TESTI20067440//INTRACELLULAR PROTEIN TRANSPORT PROTEIN US01 .//2.90E-11//553aa//20%//P25386 
TESTI20067480//ZINC FINGER PROTEIN 184 ( FRAGMENT) .//8.60E-134//421 aa//50%//Q99676 
55 TESTI20068530 

TESTI20068790//TRANS-ACTING TRANSCRIPTIONAL PROTEIN ICPO (P135 PROTEIN) (IER 2. 9/ER2.6).//7.50E- 

06//240aa//28%//P291 28 

TESTI20068940 
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TESTI20070400//Homo sapiens CTL2 gene.//5.90E-229//694aa//55%//AJ245621 
TESTI20070740 

TESTI20071130//MYOSIN HEAVY CHAIN, CARDIAC MUSCLE ALPHA ISOFORM.//6.30E-09//399aa//23%//Q02566 
TESTI20071630//GLIOMA PATHOGENESIS-RELATED PROTEIN (RTVP-1 PROTEIN).//5.10E-44//203aa//42%// 
5 P48060 

TESTI20073460 

TESTI200752401 /HYPOTHETICAL ZINC FINGER PROTEIN KIAA0961 .//3.20E-145//492aa//56%//Q9Y2G7 
TESTI20076570//Homo sapiens mitogen -activated protein kinase phosphatase x (MKPX) mRNA, complete cds.// 
7.20E-66//1 26aa//1 00%//AF1 6551 9 
10 TESTI20076920 
TESTI20079060 

TESTI20079220//ZINC FINGER PROTEIN 29 (ZFP-29).//2.00E-73//281 aa//49%//Q07230 

TESTI20079980//SEGMENT POLARITY PROTEIN DISHEVELLED HOMOLOG DVL-1 (D1SHEVELLED-1) (DSH HO- 
MOLOG 1 ).//7. 1 0E-21 2//41 3aa//93%//P51 1 41 
15 TESTI20080460 

TESTI20081890//SPA-1 like protein p1294 [Rattus norvegicus]//5.80E-129//385aa//41%//AF026504 
TESTI20083890 

TESTI20084250//OXYSTEROL-BINDING PROTEIN.//3.70E-1 83//561 aa//62%//P1 6258 
TESTI20085670 

20 TESTI20086840//CARTILAGE MATRIX PROTEIN PRECURSOR (MATRILIN-1).//3.40E-09//1 81aa//29%//P05099 

TESTI20088840//ZINC-FINGER PROTEIN RFP (RET FINGER PROTEIN).//2.60E-73//258aa//31%//Q62158 

TESTI20089290 

TESTI20090180 

TESTI20090970 
25 TESTI20091360 

TESTI20092170 

TESTI20093900 

TESTI20094620 

TESTI20095200//HYPOTHETICAL 98.3 KDA PROTEIN B0495.7 IN CHROMOSOME II.//1 .30E~63//328aa//40%// 
30 Q09216 

TESTI20095440//probable membrane protein YOR240w - yeast (Saccharomyces cerevisiae)//1 .00E-25//98aa//55%// 
S67133 

TESTI20095770//NEDD1 PROTEIN (FRAGMENT).//5.80E-297//648aa//85%//P3321 5 
TESTI20095880//HYPOTHETICAL SYM PORTER SLL1374.//3.80E-26//243aa//27%//P74168 
35 TESTI20097270 

TESTI20099350//MYOSIN HEAVY CHAIN, NONMUSCLE TYPE B (CELLULAR MYOSIN HEAVY CHAIN, TYPE B) 
(NMMHC-B).//1.50E-26//566aa//23%//P35580 

TESTI20100090//Homo sapiens endocytic receptor Endo180 (ENDO180) mRNA, complete cds.//5.00E-161//317aa// 
93%//AF1 34838 
40 TESTI201 02390 

TESTI20103690//Columba livia mRNA for 5'-nucleotidase.//2.70E-114//324aa//66%//AJ131243 

TESTI201 04090//TRANSCRIPTIONAL ENHANCER FACTOR TEF-4 (EMBRYONIC TEA DOMAIN-CONTAINING 
FACTOR) (ETF) (ETEF-1) (TEAD-2).//1 .60E-228//450aa//92%//P48301 
TESTI20105130//MYOTUBULARINV/4.30E-95//537aa//38%//O13496 
45 TESTI20105910//AMILORIDE-SENSITIVE SODIUM CHANNEL DELTA-SUBUNIT (EPITHELIAL NA+ CHANNEL 
DELTA SUBUNIT) (DELTA ENAC) (NONVOLTAGE-GATED SODIUM CHANNEL 1 DELTA SUBUNIT) (SCNED) (DEL- 
TA NACH).//0//636aa//97%//P51172 

TESTI20106170//Mus musculus spermatid WD-repeat protein mRNA, complete cds 7/2.20 E-1 67//367aa//83%// 
AF274321 

50 TESTI20106820//PROTEIN KINASE C, ETA TYPE (EC 2.7.1.-) (NPKC-ETA) (PKC-L).//5.40E-53//97aa//100%// 
P24723 

TESTI201 07240//EUKARYOTIC TRANSLATION INITIATION FACTOR 3 SUBUN IT 1 0 (EIF-3 THETA) (EUKARYOTIC 
TRANSLATION INITIATION FACTOR 3 LARGE SUBUNIT) (PNLA-35).//5.60E-07//428aa//22%//Q40554 
TESTI20107320//G1/S-SPECIFIC CYCLIN C-TYPE.//1 20E-05//130aa//26%//P93411 
55 TESTI201 07340 

TESTI20108060//SERINE/THREONINE PROTEIN PHOSPHATASE PP1 -BETA CATALYTIC SUBUNIT (EC 3.1.3.16) 
(PP-1 B).//1 . 1 0E-78//1 45aa//1 00%//P371 40 

TESTI20112540//CALDESMON (CDM).//9.10E-06//203aa//30%//P 12957 
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TESTI20112860//MYOSIN LIGHT CHAIN KINASE (EC 2.7.1.117) (MLCK)7/1 .60E-54//290aa//40%//P25323 

TESTI20113150 

TESTI201 13940 

TESTI20114480//Human 1(3)mbt protein homolog mRNA, complete cds 7/7. 10E-146//582aa//49%//U 89358 
5 TESTI201 16050 

TESTI20116120//Aegilops squarrosa partial GAG56D gene for gamma-gliadin, accession Clae 24.//3.10E-07//93aa// 
40%//AJ389681 
TESTI201 17500 
TESTI201 18460 
10 TESTI201 20500 
TESTI201 20900 

TESTI20121040//PROBABLE PROTEIN PHOSPHATASE 2CT23F11.1 (EC3.1 .3.16) (PP2C)7/8.60E-13//91aa//39%// 
P49596 

TESTI201 21 71 0//HYPOTHETICAL 57.5 KDA PROTEIN IN VMA7-RPS25A INTERGENIC REGION.//5.00E-08// 
15 292aa//26%//P5321 4 

TESTI20122070//Xenopus laevis ER1 mRNA, complete cds 7/1 .80 E-78//341 aa//49%//AF0 15454 
TESTI201 22440 
TESTI201 24440 

TESTI20125280//M-PROTEIN, STRIATED MUSCLE7/3.60E-68//295aa//46%//Q02173 
20 TESTI201 25440 

TESTI20125920//G PROTEIN PATHWAY SUPPRESSOR 1 (GPS1 PROTEIN) (MFH PROTEIN)7/5.20E-199//367aa// 
98%//Q 13098 

TESTI20126280//Mus musculus STAP mRNA for sperm tail associated protein, complete cds.//4.60E-213//769aa// 
57%//A8029919 

25 TESTI20130530//INSU LIN-DEGRADING ENZYME (EC 3.4.24.56) (INSULYSIN) (INSULINASE) (INSULIN PRO- 
TEASE).//! .60 E-237//464aa//94%//P1 4735 

TESTI201 31 440//CARBOXYPEPTIDASE A1 PRECURSOR (EC 3.4.1 7.1 ) 7/2. 80 E-1 07//332aa/%58%//P 15085 
TESTI201 32310 
TESTI201 32680 
30 TESTI201 34010 
TESTI201 34270 

TESTI20134680//CENTROMERIC PROTEIN E (CENP-E PROTElN)7/2.40E-08//796aa//1 9%//Q02224 
TESTI20134970//M.musculus Tenr mRNA for RNA binding protein 7/5. 50 E-265//559aa//88%//X84693 
TESTI20136010/IMYOSIN HEAVY CHAIN, NONMUSCLE (CELLULAR MYOSIN HEAVY CHAIN) (NMMHC)7/1 .10E- 
35 1 1 //438aa//23%//P 1 41 05 

TESTI20140970//V_segment translation product [Homo sapiens] 7/6. 30 E-51//1 01 aa//99%//AAC8021 0 
TESTI201 42480 

TESTI201 42540/IMPV1 7 PROTEIN7/8.00E-62//1 1 6aa//98%//P3921 0 

TESTI201 431 80//CENTROM ERIC PROTEIN E (CENP-E PROTEIN)7/3.30E-09//507aa//22%//Q02224 
40 TESTI20144390//TESTIS-SPECIFIC PROTEIN PBS137/6.40E-76//251aa//63%//Q01755 

TESTI20145780//Mus musculus mRNA for SH2-containing leukocyte protein 657/1 20E-13//91aa//36%//Y17159 
TESTI20148380//TRANSFORMATION-SENSITIVE PROTEIN IEFSSP 3521 7/2.50E-05//193aa//2296//P31 948 
TESTI201 49880 

TESTI20150420//RHO-GTPASE- ACTIVATING PROTEIN 1 (GTPAS E-ACTI VATI NG PROTEIN RHOGAP) (RHO-RE- 
45 LATED SMALL GTPAS E PROTEIN ACTIVATOR) (CDC42 GTPAS E-ACTI VATI NG PROTEIN) (P50-RHOGAP).// 
4.50E-09//1 29aa//34%//Q07960 

TESTI20150920//PROTEIN PHOSPHATASES PP1 REGULATORY SUBUNIT SDS22.//2.60E-16//193aa//34%// 
P22194 

TESTI20151050 
50 TESTI20151800 
TESTI201 52490 

TESTI20153310//LAMIN B3.//6.30E-13//1 04aa//41%//P48680 

TESTI201 54370//H YALURONAN MEDIATED MOTILITY RECEPTOR (INTRACELLULAR HYALURONIC ACID BIND- 
ING PROTEIN) (RECEPTOR FOR HYALURONAN-MEDIATED MOTILITY)7/1 .1 OE-07//300aa//21%//075330 
55 TESTI201 59380 
TESTI20161010 

TESTI20162780//COTE1 PROTEIN7/1 .00E-107//207aa//99%//P81408 

TESTI201 62980//DNA-DAMAGE INDUCIBLE PROTEIN DDI1 7/1 .80E-39//174aa//48%//P40087 
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TESTI20164210//PROTEIN KINASE C-BINDING PROTEIN NELL1 (NEL-LIKE PROTEIN 1) (FRAGMENT) .//1 .60E- 
88//1 63aa//92%//Q92832 
TESTI201 65680 
TESTI201 65990 

s TEST120166290//Homo sapiens NY-REN-50 antigen mRNA, partial cds.//3.00E-223//426aa//93%//AF155112 
TESTI20166670//HOMEOBOX PROTEIN HOX-B1 (HOX-2l).//6.70E-06//224aa//29%//P14653 
TESTI201 67580 

TESTI20168880//BREAKPOINT CLUSTER REGION PROTEIN (EC 2.7.1 .-).//2.1 0E-23//57aa//92%//P 11 274 
TESTI201 69500//HYPOTHETICAL 51.9 KDA PROTEIN C27F1 .04C IN CHROMOSOME I.//9.50E-1 7//428aa//25%// 
10 Q10173 

TESTI20170170//Homo sapiens mRNA for chondroitin-4-su If otransf erase (C4ST gene).//1 .10E-53//277aa//40%// 
AJ269537 
TESTI201 70280 
TESTI201 70690 
15 TESTI201 70890 

TEST1201 71 070//CYCLIC-AMP-DE PENDENT TRANSCRIPTION FACTOR ATF-4 (DNA-BINDING PROTEIN 
TAXREB67) (CYCLIC AMP RESPONSE ELEMENT-BINDING PROTEIN 2) (CREB2).//2.60E-63//136aa//91%// 
P 18848 

TESTI201 73050 
20 TESTI20173110 

TESTI20173960//ZINC FINGER PROTEIN 91 (ZINC FINGER PROTEIN HTF10) (HPF7).//1 .90E-104//335aa//48%// 
Q05481 

TESTI20175370//MYOSIN HEAVY CHAIN, NONMUSCLE (CELLULAR MYOSIN HEAVY CHAIN) (NMMHC).//1 .00E- 
1 1 //290aa//25%//P 1 41 05 

25 TESTI201 76450//thioredoxin interacting factor [Mus musculus].//1 .00E-75//300aa//46%//AAG32665 
TESTI201 79230 
TESTI201 79510 

TESTI20180600//Homo sapiens HOM-TES-85 tumor antigen mRNA, complete cds.//8.10E-106//202aa//99%// 
AF1 24430 
30 TESTI201 82210 

TESTI20182760//AMILORIDE-SENSITIVE SODIUM CHANNEL DELTA-SUBUNIT (EPITHELIAL NA+ CHANNEL 
DELTA SUBUNIT) (DELTA ENAC) (NONVOLTAGE-GATED SODIUM CHANNEL 1 DELTA SUBUNIT) (SCNED) (DEL- 
TA NACH).//5.90E-185//336aa//99%//P511 72 

TESTI20183680//EARLY NODULIN 20 PRECURSOR (N-20).//5.10E-08//127aa//37%//P93329 
35 TESTI201 84280 

TESTI201847501/LAMININ ALPHA- 1 CHAIN PRECURSOR (LAMININ A CHAIN).//2.30E-200//377aa//99%//P25391 
TESTI20184760//ZINC FINGER PROTEIN 29 (ZFP-29).//9.70E-74//281aa//49%//Q07230 
TESTI201 84820 
TESTI20186110 
40 TESTI201 92570 

TESTI201 93080//GAR22 PROTEIN.//5.1 0E-1 53//31 3aa//92%//Q99501 
TESTI201 93520 

TESTI20194880//PERIOD CIRCADIAN PROTEIN 1 (CIRCADIAN PACEMAKER PROTEIN RIGUI) (HPER).//1 .50E- 
1 1 //288aa//30%//01 5534 

45 TESTI20196690//Bos taurus pyruvate dehydrogenase phosphatase regulatory subunit precursor, mRNA, complete 
cds .//9 . 1 0E- 1 1 4//224aa//92%//AF026954 

TESTI20196970//THIMET OLIGOPEPTIDASE (EC 3.4.24.15) (ENDOPEPTIDASE 24.15) (M P78) 7/4.00 E- 106// 
1 47aa//1 00%//P52888 
TESTI20 197030 
50 TESTI20 197290 

TESTI201 97600//TRANSCRI PTIONAL REPRESSOR CTCF.//5.30E-1 20//271 aa//82%//P4971 1 
TESTI201 98540 
TESTI201 98600 

TESTI20199110//disintegrin-like testicular metalloproteinase (EC 3.4.24.-) IVb - crab-eating macaque (fragment) // 
55 1 .00E-1 67//331 aa//84%//l65253 
TESTI201 99980 
TESTI20200120 
TESTI20200840 
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TESTI20201760//SYNAPTONEMAL COMPLEX PROTEIN 1 (SCP-1 PROTEIN).//4.70E-09//1 89 aa//25%//Q 15431 

TESTI20202830 

TESTI20204260 

TESTI20205100//TRICHOHYALINV/2.60E-15//343aa//24%//P37709 
5 TESTI20205150 

TESTI20205250//phosphatidylinositol-4-phosphate 5-kinase homolog T3K9.2 - Arabidopsis thaliana //4.20E-21// 
194aa//33%//T02098 

TESTI20207170//Human testis-specific protein (TSPY) mRNA, complete cds.//6.60E-111//231aa//96%//U58096 
TESTI20209050//HYPOTHETICAL 113.1 KDA PROTEIN IN PRES-FET4 INTERGENIC REGION.//1 .90E-05//462aa// 
10 22%//Q04893 

TESTI202 10030 

TESTI20210570//RETINAL-BINDING PROTEIN (RALBP).//5.00E-53//327aa//36%//P49193 
TESTI20211380 

TESTI20212970//PUTATIVE ATP-DEPENDENT RNA HELICASE YIR002C.//5.90E-85//458aa//32%//P40562 
15 TEST1 2021 4630 

TESTI20215310//Homo sapiens calcyclin binding protein mRNA, complete cds.//6.60E-95//182aa//100%//AF057356 
TESTI20219110//TYROSINE-PROTEINKINASE-LIKE 7 PRECURSOR (COLON CARCINOMA KINASE-4) (CCK-4).// 
4.00E-23//53aa//1 00%//Q13308 
TESTI20219390 

20 TESTI20220230//Bos taurus Reissner's fiber glycoprotein I mRNA, partial cds.//1 20E-10//77aa//50%//AF078930 
TESTI20221790 

TESTI20222030//Homo sapiens very long-chain acyl-CoA synthetase (BG1) mRNA, complete cds.//6.10E-172// 
643aa//50%//AF1 79481 

TESTI20222460//DYNEIN GAMMA CHAIN, FLAGELLAR OUTER ARM.//1 50E-138//589aa//46%//Q39575 
25 TESTI20223380 

TESTI20226520//TUBULIN-TYROSINE LIGASE (EC 6.3.2.25) (TTL).//6.70E-06//1 64aa//30%//P38160 
TESTI20227380//DEAD BOX PROTEIN 4 (VASA HOMOLOG) (RVLG).//1 10E-263//577aa//86%//Q64060 
TE STI 20228 120//RHO-GTPASE- ACTIVATING PROTEIN 6 (RHO-TYPE GTPASE- ACTIVATING PROTEIN RHOG- 
APX-1) (FRAGMENT).//4.50E-12//164aa//34%//054834 
30 TESTI20228740//ZINC FINGER PROTEIN 135 7/3.50 E-25//132aa//43%//P52742 

TESTI20244220//MYOSIN HEAVY CHAIN IB (MYOSIN HEAVY CHAIN IL).//6.50E-11//77aa//42%//P1 9706 
TESTI20244430//ANKYRIN 1 (ERYTHROCYTE ANKYRIN) (ANKYRIN R) (ANKYRINS 2.1 AND 2.2).//3.70E-15// 
173aa//35%//P16157 

TESTI20244460//ADENYLATE KINASE, CHLOROPLAST (EC 2.7.4.3) (ATP-AMP TRANSPHOSPHORYLASE).// 
35 3.60E-34//209aa//37%//F431 88 

TESTI20244730//Mus musculus alpha/beta hydrolase-1 mRNA, complete cds.//8.20E-113//266aa//81%//AF1 89764 

TESTI20245600//HYPOTHETICAL 118.4 KDA PROTEIN IN BAT2-DAL5 INTERGENIC REGION PRECURSOR.// 

4.50E-05//236aa//27%//P471 79 

TESTI20245860 
40 TESTI20246410 

TESTI20246480//Homo sapiens germline specific RNA binding protein (DAZL1) mRNA, complete cds.//5.00E-22// 
86aa//55%//U66726 

TESTI20247440//Human BLu protein testis isoform (BLu) mRNA, complete cds.//5.40E-45//91 aa//96%//U70880 
TESTI20248850 

45 TESTI20249360//Homo sapiens DEME-6 mRNA, partial cds.//4.70E-94//299aa//56%//AF0071 70 
TESTI20250220//TRICHOHYALIN.//5.40E-54//537aa//30%//P37709 

TESTI20250630//Columba livia mRNA for 5'-nucleotidase.//3.80E-115//328aa//66%//AJ131243 
TESTI20251440//Rattus norvegicus (rsec6) mRNA, complete cds.//3.80E-31//379aa//2896//U32575 
TESTI 20251 61 0//INTER-ALPHA-TRYPSIN INHIBITOR HEAVY CHAIN H2 PRECURSOR (ITI HEAVY CHAIN H2).// 
50 1 .1 0E-07//1 82aa//26%//002668 

TESTI20251740//FYN-BINDING PROTEIN (SLP-76 ASSOCIATED PROTEIN) (SLAP-130).//6.30E-16//88aa//50%// 
015117 

TESTI20252690//SEGMENT POLARITY PROTEIN DISHEVELLED. HOMOLOG DVL-3 (DISHEVELLED-3) (DSH HO- 
MOLOG 3) (KIAA0208).//4.60E-137//304aa//85%//Q92997 
55 TESTI20254030//Homo sapiens actin-binding double-zinc-finger protein (abLIM) mRNA, complete cds.//6.70E-150// 
280aa//96%//AF005654 
TESTI20254090 
TESTI20254480 
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TESTI20254990//ZINC FINGER PROTEIN GLI3 (NEURAL SPECIFIC DNA BINDING PROTEIN XGLI3) (XGLI-3).// 
6.50E-46//1 05aa//75%//Q91 660 

TESTI20255460//Mus musculus mRNA for MIWI (piwi), complete cds.//1 .50E-225//864aa//49%//AB032604 
TESTI20256560//NUF1 PROTEIN (SPINDLE POLY BODY SPACER PROTEIN SPC110).//5.70E-05//590aa//19%// 
5 P32380 

TESTI 202579 10//H LA CLASS I HISTOCOMPATIBILITY ANTIGEN, ALPHA CHAIN G - PRECURSOR (HLA G ANTI- 
GEN) 7/1 .90E-1 22//223aa//1 00%//P1 7693 

TESTI20258720//ANKYRIN 1 (ERYTHROCYTE ANKYRIN) (ANKYRIN R) (ANKYRINS 2.1 AND 2.2).//1 .20E-1 0// 
233aa//27%//P16157 
10 TESTI20259110 
TESTI20259200 
TESTI20260140 

TESTI20260640//helicase II homolog - fruit fly (Drosophila sp.)//2.70E-27//374aa//27%//T13889 
TESTI20261 040//DPY-1 9 PROTEIN.//2.1 0E-47//31 6aa//34%//P3441 3 
15 TESTI20261160//Mus musculus rasGAP -activating- 1 ike protein mRNA, complete cds.//1 .60E-21//129aa//41%// 
AF086714 

TESTI20261680//HEAT SHOCK PROTEIN 30C.//1 .70E-08//136aa//27%//P30218 

TESTI20262150//Rattus norvegicus mRNA for voitage-gated ca channel, complete cds.//0//822aa//87%//AB0 18253 
TESTI20262940//TUBULIN-TYROSINE LIGASE (EC 6.3.2.25) (TTL).//5.70E-10//218aa//25%//P381 60 
20 TESTI20264530//PUTATIVE SERINE/THREONINE-PROTEIN KINASE PKWA (EC 2.7.1 .-.//5.60E-18//21 9aa//34%// 
P49695 

TESTI20264910 
TESTI20265150 
TESTI20265340 
25 TESTI20265890 

TESTI20266050//52 KDA RO PROTEIN (SJOGREN SYNDROME TYPE A ANTIGEN (SS-A)) (RO(SS-A))7/2.1 0E- 
77//472aa//38%//P1 9474 

TESTI20268240//Homo sapiens membrane-associated nucleic acid binding protein mRNA, partial cds.//1 .90E-52// 
41 2aa//36%//AF255303 
30 TESTI20269250 
TESTI20269360 

TESTI20270130//FIBRILLARIN7/2.10E-11//97aa//43%//Q22053 
TESTI20271790//Human p76 mRNA, complete cds.//6.9e-317//496aa//99%//U81006 
TESTI20272380 

35 TESTI20274960//ZINC FINGER PROTEIN 83 (ZINC FINGER PROTEIN HPF1).//7.90E-126//342aa//63%//P51522 
TESTI20277300 

TESTI20278280//Mus musculus p53 apoptosis-associated target (Perp) mRNA, complete cds.//3.40E-84//155aa// 
88%//AF249870 

TESTI20282420//Mus musculus EPCS26 mRNA, complete cds7/2.40E-19//122aa//3396//AF250838 
40 TESTI20282530//ZINC FINGER PROTEIN 1357/1 .60E-56//271aa//39%//P52742 
TESTI20282900 

TESTI20284260//HISTONE H2B F (H2B 291 A) 7/2. 1 0E-22//1 20aa//43%//P1 0853 

TESTI20285230//DOUBLE-STRANDED RNA-SPECIFIC EDITASE 1 (EC 3.5.-.-) (DSRNA ADENOSINE DEAMI- 
NASE)(RNA EDITING ENZYME 1).//2.10E-20//192aa//38%//P51400 
45 TESTI20286590//Human type 3 inositol 1 ,4,5-trisphosphate receptor (ITPR3) mRNA, complete cds7/1 .00E-93// 
1 85aa//1 00%//U01 062 
TESTI20287760 

THYMU10004280//ZINC-FINGER PROTEIN HT2A (72 KDA TAT- INT ER ACTING PROTEIN)7/7.90E-13//87aa//42%// 
Q 13049 

50 THYMU20006020//ISOCITRATE DEHYDROGENASE [NAD] SUBUNIT ALPHA, MITOCHONDRIAL PRECURSOR 

(EC 1.1.1.41) (ISOCITRIC DEHYDROGENASE) (NAD+-SPECIFIC ICDH)7/2.60E-120//229aa//99%//P50213 

THYMU20007020 

THYMU20007750 

THYMU20008000 
55 THYMU20009460 

THYMU20009500 

THYMU20009710 

THYMU20010180//MOB1 PROTEIN (MPS1 BINDER 1)7/2. 60 E-31//1 36aa//50%//P40484 
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THYMU20010710 
THYMU20012020 
THYMU20012560 

THYMU20013250//LIM DOMAIN KINASE 2 (EC 2.7.1.-) (LIMK-2).//3.50E-213//354aa//99%//P53671 
5 THYMU20013810//Human SEC7 homolog Tic (TIC) mRNA, complete cds.//2.30E-88//138aay/96%//U63127 

THYMU20014430 

THYMU20017270 

THYMU20018250 

THYMU20018390 
W THYMU20019000 

THYMU20019260//ZINC FINGER PROTEIN 85 (ZINC FINGER PROTEIN HPF4) (HTF1).//6.80E-49//137aa//68%// 
Q03923 

THYMU20020370 

THYMU20020800//LMBR1 long form [Mus musculus].//3.00E-69//198aa//55%//AAF91 092 
is THYMU20021090//Homo sapiens Sex comb on mldleg homolog 1 isoform 1 (SCMH1) mRNA, complete cds.//4.40E- 
80//1 49aa//63%//AF1 49045 
THYMU20021540 

THYMU20023560//DNA BINDING PROTEIN RFX27/6.70E-25//59aa//94%//P48378 

THYMU20024500//N EURO FILAMENT TRIPLET M PROTEIN (160 KDA NEUROFILAMENT PROTEIN) (NF-M).// 
20 6.20E-06//296aa//22%//P1 6053 
THYMU20025480 

THYMU20026950//Mus musculus ROSA 26 transcription AS ROSA26AS mRNA, complete cds.//2.20E-12//285aa// 
23%//U83176 

THYMU20028150//AXONIN-1 PRECURSOR (AXONAL GLYCOPROTEIN TAG-1) (TRANSIENT AXONAL GLYCO- 
LS PROTEIN 1 ).//2.20E-33//301 aa//31 %//Q02246 

THYMU20028410//Mus musculus Pax transcription activation domain interacting protein PTIP mRNA, complete cds.// 

1 .70E-144//345aa//81%//AF1 04261 

THYMU20029830 

THYMU20030460//Homo sapiens tumor endothelial marker 7 precursor (TEM7) mRNA, complete cds.//2.1 0E-123// 
30 230aa//99%//AF279 1 44 
THYMU20030690 

THYMU20031330//Homo sapiens putative nucleotide binding protein mRNA, complete cds.//1 O0E-18//64aa//82%// 
AF1 18394 

THYMU20032820//ZINC FINGER PROTEIN 1357/1 .40E-82//258aa//55%//P52742 
35 THYMU20034400//26S proteasome subunit p44.5 [Homo sapiens]//8.80E-29//71 aa//91 %//AB0031 02 
THYMU20034790 
THYMU20036500 

THYMU20039320//PUTATIVE SERINE/THREONINE-PROTEIN KINASE PKWA (EC 2.7.1 .-) 7/1 .40 E-09//206aa// 

27%//P49695 
40 THYMU20043440 

THYMU20043560 

THYMU20044100 

THYMU20044520 

THYMU20046350 
45 THYMU20046770 

THYMU20049060//SPLICING FACTOR, ARGININE/SER IN E-RICH 2 (SPLICING FACTOR SC35) (SC-35) (SPLICING 

COMPONENT, 35 KDA) (PR264 PROTEIN).//6.00E-41//11 9aa//76%//P30352 

THYMU20050010 

THYMU20051340 

so THYMU20052460//PHORBOLIN I (FRAGMENTS)7/5.80E-20//111aa//45%//P31941 

THYMU20052830//Homo sapiens mRNA for immunoglobulin lambda heavy chain.//1 .50E-237//477aa//90%//Y14737 

THYMU20054800 

THYMU20055450 

THYMU20055460//ESTERASE D (EC 3. 1. 1. 1).//7.60E-57//107aa//100%//P 10768 
55 THYMU20055740 

THYMU20055760//Mus musculus group IIF secreted phospholipase A2 (Pla2g2f) mRNA, complete cds.//1 .70E-73// 

1 65aa//75%//AF1 66099 

THYMU20058550 
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THYMU20060480 
THYMU20062520 

THYMU20062610//DYNEIN GAMMA CHAIN. FLAGELLAR OUTER ARM.//1 .70E-156//585aa//50%//Q39575 
THYMU20062770//UROMODULIN PRECURSOR (TAMM-HORSFALL URINARY GLYCOPROTEIN) (THP).//4.20E- 
5 22//253aa//28%//P27590 

THYMU20063650//Homo sapiens mRNA for putative ribulose~5-phosphate-epimerase, partial cds.//9.30E-57//1 1 6aa// 

97%//AJ224326 

THYMU20064680 

THYMU20066660//Homo sapiens putative ATP-dependent RNA helicase ROK1 mRNA, complete cdsV/9.20E-153// 
10 361 aa//85%//AF077033 
THYMU20069130 
THYMU20069460 
THYMU20069650 

THYMU20070250//TRANSKETOLASE (EC 2.2.1.1) (TK).//4.50E-288//513aa//96%//P29401 
15 THYMU20071120//ZINC FINGER PROTEIN 85 (ZINC FINGER PROTEIN HPF4) (HTF1)V/1 .40E-229//536aa//74%// 
Q03923 

THYMU20071460 
THYMU20072580 
THYMU20073070 
20 THYMU20073080 

THYMU20077250//T-CELL-SPECIFIC TRANSCRIPTION FACTOR 1 (TCF-1) (T-CELL FACTOR 1) (TRANSCRIP- 
TION FACTOR-7).//4. 1 0E-90//1 76aa//96%//Q0041 7 
THYMU20078020 
THYMU20078240 
25 THYMU20079690 
THYMU20080490 

THYMU20081110//PINCH PROTEIN (PARTICULARY INTERESTING NEW CYS-HIS PROTEIN).//4.00E-28//60aa// 
86%//P48059 
THYMU20083390 
30 THYMU20083500 

THYMU20083830//Homo sapiens angiostatin binding protein 1 mRNA, complete cds.//2.30E-09//230aa//28%// 

AF286598 

THYMU20084520 

THYMU20086430 

35 THYMU20087270//POTENTIAL PHOSPHOLIPID-TRANSPORTING ATPASE IIB (EC 3.6.1 .-.//7. 20 E-154//235aa// 
88%//P98195 
THYMU20089170 
THYMU20089900 

THYMU20090230//Homo sapiens ribonucleoprotein mRNA, complete cds.//9.30E-73//133aa//100%//L32610 
40 THYMU20091040 

THYMU20095920//Homo sapiens nuclear prelamin A recognition factor mRNA, complete cds.//2.90E-94//178aa// 
100%//AF1 28406 

THYMU20096580//SERYL-TRNA SYNTHETASE (EC 6.1.1.11) (SERINE--TRNA LIGASE) (SEARS).//9.90E-20// 
45aa//97%//P49591 

45 THYMU20097920//mitogen inducible gene mig-2 - human//2.50E-1 97//477aa//55%//S69890 

THYMU20098350//KERATIN, TYPE II CYTOSKELETAL 5 (CYTOKERATIN 5) (K5) (CK 5) (58 KDA CYTO KERATIN).// 
3.40E-267//577aa//89%//P1 3647 

THYMU20099060//IG ALPHA- 1 CHAIN C REGION7/1 .70E-1 96//353aa//99%//P01876 

THYMU201 00940//POTENTIAL PHOSPHOLIPID-TRANSPORTING ATPASE IR (EC 3.6.1.-) (FRAGMENT).//©// 
50 646aa//99%//Q9Y2G3 

THYMU20104480//TRICHOHYALINV/2.90E-21//300aa//28%//P37709 

THYMU20106990//Mus musculus evectin-2 (Evt2) mRNA, complete cds.//2.20E-112//222aa//90%//AF189817 
THYMU201 1 0720//P ROB ABLE AMINOTRANSFERASE T01 B11 .2 (EC 2.6.1 .-).//6.90E-40//2 11 aa//41 %//P9 1408 
THYMU20112570 

55 THYMU20112590//Homo sapiens AP-4 adaptor complex beta4 subunit mRNA, complete cds.//3.40E-306//534aa// 
99%//AF092094 
THYMU20115380 
THYMU201 15730 
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THYMU201 17850 

THYMU20120240//Arabidopsis thaliana ubiquitin-specific protease (AtUBP4) imRNA, complete cds.//4.10E-1 8// 
1 79aa//28%//U76846 

THYMU201 20730//ALDEHYDE DEHYDROGENASE, DIMERIC NAD P-P REFER RING (EC 1.2.1.5) (ALDH CLASS 
5 3)7/2.1 0E-203//379aa//99%//P30838 

THYMU20121040//ELONGATION FACTOR 1 -DELTA (EF-1 -DELTA) 7/5. 30E-1 49//281 aa//99%//P29692 

THYMU20128910 

THYMU201 29020 

THYMU201 30470 
w THYMU201 34260 

THYMU20137050 

THYMU201 37570 

THYMU20139160 

THYMU201 40510 

15 THYMU20143230//Homo sapiens mRNA for stabilin-1 (stabl gene)7/1 .70E-177//317aa//99%//AJ275213 
THYMU20145990//nesca protein [Homo sapiens]7/1 .90E-152//282aa//98%//NP_055143 
THYMU201 48010 
THYMU201 49230 
THYMU20150190 

20 THYMU20151610//Homo sapiens antigen NY-CO-1 (NY-CO-1) mRNA, complete cds.//1 .40E-181//344aa//100%// 
AF039687 

THYMU20153210//Homo sapiens Diff33 protein homolog mRNA, complete cds.//4.00E-120//404aa//54%//AF1 64794 
THYMU201 54790 
THYMU201 57620 
25 THYMU201 63600 

THYMU20170080//Homo sapiens SIT protein. //9.50E-48%/78aa//98%//AJ01 0059 

THYMU20170230//Homo sapiens sarcosine dehydrogenase (SARDH) mRNA, complete cds.//1 .80E-183//260aa// 
99%//AF095735 
THYMU20171580 
30 THYMU201 74490 
THYMU201 74790 
THYMU201 75260 

THYMU20176010//VEGETATIBLE INCOMPATIBILITY PROTEIN HET-E-1 7/1 .80E-20//256aa//28%//Q00808 
THYMU201 77070 

35 THYMU20178440//Homo sapiens mRNA for immunoglobulin lambda heavy chain.//2.20E-229//479aa//88%//Y14737 
THYMU20181890 
THYMU201 84550 
THYMU201 85470 

THYMU201 85650//DIAPHANOUS PROTEIN HOMOLOG 1 7/1 .30 E-20//85aa//44%//0606 1 0 
40 THYMU201 87210 

THYMU20191970//Homo sapiens FLAMINGO 1 mRNA, partial cds 7/1 .80 E-54//450aa//36%//AF234887 
TKIDN10000620//Homo sapiens Tax interaction protein 2 mRNA, partial cds7/1 .20E-56//114aa//100%//AF028824 
TKIDN10001710 

TKIDN10001920//ZINC FINGER PROTEIN 85 (ZINC FINGER PROTEIN HPF4) (HTF1).//3.20E-97//226aa//73%// 
45 Q03923 

TRACH2001 1 01 0//GLUCOAMYLASE S1/S2 PRECURSOR (EC 3.2.1.3) (GLUCAN 1 ,4-ALPHA-GLUCOSIDASE) 
(1 ,4-ALPHA-D-GLUCAN GLUCOHYDROLASE).//4.30E-1 7//593aa//22%//P08640 
TRACH20011540//TUMOR-ASSOCIATED ANTIGEN L67/4.70E-57//113aa//97%//P30408 
TRACH20012490 
50 TRACH20021 000 

TRACH20021 380//ADENYLATE CYCLASE, TYPE V (EC 4.6.1 .1) (ATP PYROPHOSPHATE- LYASE) (CA(2+)-INHIB- 
ITABLE ADENYLYL CYCLASE)7/1 .20E-276//492aa//95%//P30803 
TRACH20025370 
TRACH20026640 
55 TRACH20029880 

TR ACH20040390//M ATE R N AL PUMILIO PROTEIN.//1 .20E-177//812aa//47%//P25822 
TRACH20041 090 

TRACH20043360//PUTATIVE KINESIN-LIKE PROTEIN C2F12.13.//2.30E-55//333aa//39%//014343 
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TRACH20044990 
TRACH20049500 
TRACH20051 590 
TRACH20057200 
5 TRACH20058000 
TRACH20073990 
TRACH20080810 
TRACH20081270 

TR ACH20090060//SYNAPTOTAGM I N IV.//1 .10E-12//301aa//25%//P40749 
10 TRACH20091 070//ALDEHYDE DEHYDROGENASE, DIMERIC NADP-PREFERRING (EC 1.2.1.5) (ALDH CLASS 
3).//1 .00E-173//328aa//98%//P3O838 

TRACH20093400//TR1CHOHYALIN.//2.30E-1 3//701 aa//20%//P37709 
TRACH20093480 

TRACH20098510//MYOSIN HEAVY CHAIN, GIZZARD SMOOTH MUSCLE.//1 . 1 0E-20//640aa//21 %//P1 0587 
15 TRACH20101590 
TRACH201 04510 

TRACH20108240//ribonucleoprotein - African clawed frog//4.10E-118//223aa//96%//S40774 

TRACH201 1 3020//SELENIDE, WATER DIKINASE 2 (EC 2.7.9.3) (SELENOPHOSPHATE SYNTHETASE 2) (SELE- 
NIUM DONOR PROTEIN 2)7/1 .80E-207//364aa//96%//Q99611 
20 TRACH20122980//HYPOTHETICAL PROTEIN MJ0798.//6.80E-13//211aa//22%//Q58208 
TRACH201 23870 
TRACH20124970 
TRACH201 25620 
TRACH20129180 

25 TRACH20131230//Homo sapiens oxysterol binding protein-related protein 3 (ORP3) mRNA, complete cds.//7.50E- 
282//608aa//62%//AY008372 
TRACH201 39280 
TRACH20140180 

TRACH20143710//RAB GERANYLGERANYLTRANSFERASE ALPHA SUBUNIT (EC 2.5.1 .-) (RAB GERANYL-GER- 
30 ANYLTRANSFERASE ALPHA SUBUNIT) (RAB GG TRANSFERASE) (RAB GGTASE).//9.70E-07//142aa//33%// 
Q92696 

TRACH20149500//KERATIN, TYPE I CUTICULAR HA6 (HAIR KERATIN, TYPE I HA6).//1 .10E-62//215aa//60%// 
076013 

TRACH201 49720 

35 TR ACH201 49740//EXCITATO RY AMINO ACID TRANSPORTER 5 (RETINAL GLUTAMATE TRANSPORTER).// 
5.00E-76//1 52aa//98%//000341 
TRACH201 58240 
TRACH201 59390 
TRACH201 60800 

40 TRACH20163470//Mus musculus putative thymic stromal co-transporter TSCOT mRNA, complete cds.//3.20E-41// 
1 87aa//34%//AF1 481 45 

TRACH201 641 00//RETROVIRUS-RELATED PROTEASE (EC 3.4.23.-) .//1 .40E-32//113aa//56%//P1 0265 
TRACH201 64810 
TRACH201 65330 

45 TRACH20165540//Human alpha-1 type I collagen gene surrounding osteogenesis imperfecta Ol type II deletion.// 
4.00E-05//1 02aa//37%//M1 1 1 62 
TRACH201 67090 

TRACH20170860//IG DELTA CHAIN C REGION.//1 .60E-212//383aa//100%//P01880 
TRACH20173680//Homo sapiens mRNA for LAK-4p, complete cds.//3.50E-80//41 0aa//38%//AB002405 
50 TRACH201 74980 
TRACH201 82780 
TRACH20185120 

TRACH20188350//Homo sapiens mRNA forcentaurin beta27/3.80E-60//204aa//56%//AJ238248 
TRACH20190460//MONO- AND DIACYLGLYCEROL LIPASE PRECURSOR (EC 3.1 .1 .-) (MDGL).//2.60E-11//195aa// 
55 28%//P25234 

UMVEN1 0001 380 

UTERU1 0001 060//ETS-DOMAIN PROTEIN ELK-1 .//1 .40E-39//88aa//93%//P1 941 9 
UTERU1 0001 870 
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UTERU20000230 

UTERU20000950//Homo sapiens PC326 protein (PC326) mRNA, complete cds.//2.80E-55//112aa//1 00%//AF1 50734 

UTERU20011760 

UTERU20013890 

5 UTERU20016580//POTENTIAL TRANSCRIPTIONAL ADAPTOR.//1 .90E-37//323aa//31 %//Q02336 

UTERU20026620//ZINC FINGER PROTEIN 757/7.50E-82//174aa//82%//P51815 

UTERU20027360 

UTERU20029930 

UTERU20031350 
10 UTERU20035770 

UTERU20040150 

UTERU20040370 

UTERU20040390 

UTERU20040730 

15 UTERU20041630//ZINC FINGER PROTEIN 184 (FRAGMENT).//3.30E~75//239aa//44%//Q99676 
UTERU20041970 
UTERU20045200 

UTERU20051790//guanylate klnase-interacting protein 1 Maguin-1, membrane-associated - rat//8.20E-26//267aa// 
33%//T18293 

20 UTERU20064120//MYELOID UPREGULATED PROTEIN.//1 .30E-79//245aa//66%//035682 
UTERU20065470 
UTERU20079240 
UTERU20083020 

UTERU20086530//GLYCODELIN PRECURSOR (GD) (PREGNANCY-ASSOCIATED ENDOMETRIAL ALPHA-2 
25 GLOBULIN) (PEG) (PA EG) (PLACENTAL PROTEIN 14) (PROGESTERONE-ASSOCIATED ENDOMETRIAL PRO- 
TEIN) (PROGESTAGEN-ASSOCIATED ENDOMETRIAL PROTEIN) 7/6.00 E-73/m0aa//99%//P09466 
UTERU20087070//COMPLEMENT C1 R COMPONENT PRECURSOR (EC 3.4.21 .4 1)7/4. 80 E-206//360aa//99%// 
P00736 

UTERU20087850//Homo sapiens mRNA for mucolipidin (ML4 gene).//2.70E-34//184aa//42%//AJ293970 
30 UTERU20089300 

UTERU20089390//Plectonema boryanum kinesin light chain (KLC) gene, complete cds.//3.30E-38//21 6aa//43%// 
U78597 

UTERU20089620//Homo sapiens radical fringe (RFNG) gene, partial cds7/1 .50E-31//65aa//1 00%//AF108139 
UTERU20090940 
35 UTERU20091470 

UTERU20094830//SI NGLE-M IN DED HOMOLOG 2 (SIM TRANSCRIPTION FACTOR) (MS)M).//8.70E-09//427aa// 

25%//Q61079 

UTERU20095100 

UTERU20099040//ZINC TRANSPORTER 2 (ZNT-2)7/9.80E-96//242aa//76%//Q62941 
40 UTERU20099510//ZINC FINGER PROTEIN 135.//2.20E-1 07//346aa//54%//P52742 

UTERU20141150//NUCLEAR FACTOR NF-KAPPA-B P49 SUBUNIT.//2.20E-07//76aa//47%//Q04860 
UTERU201 02260 
UTERU201 03040 
UTERU201 03200 

45 UTERU20104310//DNA-DIRECTED RNA POLYMERASE II 14.4 KDA POLYPEPTIDE (EC 2.7.7.6) (RPB6) 
(RPB14.4).//1 .80E-48//97aa//1 00%//P41584 
UTERU201 06510 
UTERU20121140 

UTERU20122520//MOESIN (MEMBRANE-ORGANIZING EXTENSION SPIKE PROTEIN).//6.00E-37//399aa//28%// 
50 P26042 

UTERU201 25810 

UTERU20127030//LAMININ BETA-2 CHAIN PRECURSOR (S-LAMININ)7/6.80E-175//377aa//92%//P55268 
UTERU20127150 

UTERU20128560//26.4 KDA PROTEIN IN RUVC-ASPS INTERGENIC REGION.//2.60E-1 7//1 20aa//34%//P24237 
55 UTERU20132620//AXONE ME- ASSOCIATED PROTEIN MST101 (2)7/1 .40E-15//231aa//31%//Q08696 

UTERU20134830//pellino (Drosophila) homolog 2 [Homo sapiens] //1 .40E-153//361 aa//729o//NP__067078 
UTERU20139760//soIute carrier family 25 (mitochondrial carrier; peroxisomal membrane protein, 34kD), member 17 
[Homo sapiens]//5.40E-100//203aa//97%//XP_001136 
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UTERU201 40010 
UTERU201 67570 

UTERU20168960//Homo sapiens actin filament associated protein (AFAP) mRNA, complete cds.//2.60E-68//364aa// 
43%//AF 188700 

5 UTERU20169020//HOMEOBOX PROSPERO-LIKE PROTEIN PROX1 (PROX 1)7/1 .30E-54//11 7aa//74%//Q91018 
UTERU201 73030 
UTERU201 76230 

UTERU20177150//Homo Sapiens zinc finger protein dp mRNA, complete cds.//4.60E-10//1 04aa//40%//AF 153201 
UTERU20181270 

10 UTERU20185220//Human mRNA for transcriptional activator hSNF2a, complete cds.//1 .60E-125//246aa//98%// 
D26155 

UTERU20188670//HFM1 PROTEIN.//5.10E-19//234aa//26%//P51979 
UTERU201 88840 

15 

Claims 

1 . A polynucleotide selected from the group consisting of the following (a) to (g) : 

20 (a) a polynucleotide comprising a protein-coding region of the nucleotide sequence of any one of SEQ ID NOs: 

1 to 1970; 

(b) a polynucleotide encoding a polypeptide comprising the amino acid sequence of any one of SEQ ID NOs: 
1971 to 3940; 

(c) a polynucleotide comprising a nucleotide sequence encoding a polypeptide comprising the amino acid 
25 sequence of any one of SEQ ID NOs: 1 971 to 3940, wherein, in said amino acid sequence, one or more amino 

acids have been substituted, deleted, inserted, and/or added, and wherein said nucleotide sequence encodes 
a polypeptide functionally equivalent to a polypeptide comprising the selected amino acid sequence; 

(d) a polynucleotide hybridizing to a polynucleotide comprising the nucleotide sequence of any one of SEQ 
ID NOs: 1 to 1970, wherein said nucleotide sequence encodes a polypeptide functionally equivalent to a 

30 polypeptide encoded by the selected nucleotide sequence; 

(e) a polynucleotide comprising a nucleotide sequence encoding a partial amino acid sequence of a polypep- 
tide encoded by the polynucleotide according to any one of (a) to (d); 

(f) a polynucleotide comprising a nucleotide sequence having at least 70% identity to the nucleotide sequence 
of any one of SEQ ID NOs: 1 to 1970; and 

35 (g) a polynucleotide comprising a nucleotide sequence having at least 90% identity to the nucleotide sequence 

of any one of SEQ ID NOs: 1 to 1970. 

2. A polypeptide encoded by the polynucleotide of claim 1 , or a partial peptide thereof. 
40 3. An antibody binding to the polypeptide or the peptide of claim 2. 

4. A method for immunologically assaying the polypeptide or the peptide of claim 2, said method comprising the steps 
of contacting the polypeptide or the peptide of claim 2 with the antibody of claim 3, and observing the binding 
between the two. 

45 

5. A vector comprising the polynucleotide of claim 1 . 

6. A transformant carrying the polynucleotide of claim 1 or the vector of claim 5. 

50 7. a transformant carrying the polynucleotide of claim 1 or the vector of claim 5 in an expressible manner. 

8. A method for producing the polypeptide or the peptide of claim 2, said method comprising the steps of culturing 
the transformant of claim 7 and recovering an expression product. 

55 9. An oligonucleotide comprising at least 15 nucleotides, said oligonucleotide comprising a nucleotide sequence 
complementary to the nucleotide sequence of any one of SEQ ID NOs: 1 to 1970 or to a complementary strand 
thereof. 
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10. Use of the oligonucleotide of claim 9 as a primer for synthesizing the polynucleotide of claim 1 . 

11. Use of the oligonucleotide of claim 9 as a probe for detecting the polynucleotide of claim 1 . 

12. An antisense polynucleotide against the polynucleotide of claim 1 or a part thereof. 

13. A method for detecting the polynucleotide of claim 1 , said method comprising the following steps of: 

a) incubating a target polynucleotide with the oligonucleotide of claim 9 under hybridizable conditions, and 

b) detecting hybridization of the target polynucleotide with the oligonucleotide of claim 9. 

14. A database of polynucleotides and/or polypeptides, said database comprising information on at least one of the 
nucleotide sequences of SEQ ID NOs: 1 to 1970 and/or on at least one of the amino acid sequences of SEQ ID 
NOs: 1971 to 3940. 
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Figure 1 
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